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In  a  two-sample  situation,  a  two-parameter  mixture  model  is  postulated  for  the 
treatment  group  where  a  proportion  n  of  the  subjects  are  affected  favorably  by  the 
treatment,  and  the  remaining  proportion  of  nonresponders  behave  like  the  control 
patients.  The  response  of  interest  is  subject  to  random  censoring. 

A  linear  rank  statistic  is  developed  that  has  scores  that  are  derived  from  the 
probability  of  the  underlying  rank  vector  arising  from  data  governed  by  the  mixture 
model.  The  technique  used  to  generate  the  scores  is  one  that  emphasizes  optimality 
properties  for  values  of  the  parameter  tt  in  the  neighborhood  of  zero.  Writing  the 
linear  rank  statistic  as  a  stochastic  integral  leads  to  a  large  sample  test  procedure. 
The  performance  of  the  test  procedure  is  compared  to  standard  rank-based  tests 
in  terms  of  empirical  power  for  a  variety  of  underlying  distributions,  censoring 
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patterns,  and  moderate  to  large  sample  sizes.  Comparisons  via  asymptotic  relative 
efficiency  and  real  data  examples  are  presented  as  well. 

Simulation  studies  indicate  that  the  mixture  model-based  test  procedures  are 
more  efficient  than  the  standard  tests  in  detecting  improvement  due  to  treatment 
when  the  proportion  of  responders  is  small  and  the  improvement  is  substantial. 
Because  the  mixed  model  test  statistic  depends  on  an  unknown  parameter,  the 
simulations  also  provide  some  guidance  in  choosing  a  parameter  value  that  provides 
a  test  that  is  both  sensitive  to  detect  a  true  treatment  effect  and  is  valid  in  terms 
of  the  chosen  significance  level. 
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CHAPTER  1 
INTRODUCTION 


1-1     The  Nonresponders  Problem 


The  "nonresponders"  problem  may  arise  in  studies  designed  to  evaluate  the 
efficacy  and/or  toxicity  of  new  drugs.  Such  studies  generate  data  from  two  groups— 
the  Treatment  group  consisting  of  responses  to  the  drug  and  the  Control  group 
consisting  of  responses  to  a  placebo.  In  cases  where  the  drug  is  designed  to  combat 
one  of  several  different  factors  that  can  be  the  chief  cause  of  the  disease,  a  portion 
of  the  treated  subjects  will  respond  like  the  control  subjects.  In  other  words,  this 
portion  of  subjects  will  not  respond  to  the  treatment,  and  the  response  of  interest 
can  be  regarded  as  an  observation  that  would  occur  had  the  subject  been  a  member 
of  the  control  group.  The  subjects  are  denned  to  be  nonresponders  to  treatment. 

Salsburg  (1986)  presents  an  excellent  discussion  of  the  nonresponders  problem. 
He  notes  that  many  prospective  drug  compounds  are  designed  to  combat  a  particular 
cause  of  disease  that  can  be  successfully  identified  and  targeted.  One  example 
cited  by  Salsburg  (1986)  is  an  enzyme-inhibitor  compound  for  emphysema.  Such 
a  compound  shuts  down  the  activity  of  a  particular  proteolytic  enzyme  that  is 
known  to  be  a  major  factor  in  the  presence  of  the  disease.  Other  patients  with 
emphysema  may  not  have  the  enzyme  as  a  primary  agent,  so  that  the  compound 
will  consequently  not  affect  the  emphysema. 


In  a  similar  vein,  serum  activity  of  certain  enzymes  increase  when  the  disease 
of  hepatitis  is  present.  Medical  researchers  may  try  to  isolate  the  enzyme(s)  and 
investigate  whether  a  lowering  of  the  serum  enzyme  level(s)  will  result  in  amelio- 
ration of  the  hepatitis  for  patients.  We  may  be  able  only  to  determine  a  specific 
enzyme  as  a  major  risk  factor  by  treating  a  patient  with  a  compound  designed  to 
specifically  act  on  serum  enzyme  levels.  In  cases  where  the  enzyme  is  the  cause  of 
the  disease,  the  patient  will  respond,  in  other  cases  he/she  will  not.  There  is  no 
external  mechanism  to  determine  whether  or  not  a  patient  is  a  responder  to  the 
treatment. 

Besides  enzyme-inhibitors,  Salsburg  (1986)  mentions  receptor  site  agonizers  and 
hormone-mimickers  as  prospective  compounds  to  combat  conditions  such  as  con- 
gestive heart  failure  and  arthritis.  Clinical  investigations  of  such  compounds  will 
encounter  the  same  nonresponders  problem.  Boos  and  Brownie  (1986)  mention 
the  areas  of  nutritional  supplementation  and  behavioral  toxicology  where  the  as- 
sumption of  the  existence  of  nonresponders  is  reasonable.  The  use  of  antibiotics  on 
rats  infected  with  the  Herpes  virus  served  as  an  illustration  of  the  nonresponders 
phenomenon  for  Good  (1979). 

Johnson,  Verill,  and  Moore  (1987)  reported  on  a  study  by  Carano  and  Moore 
(1982)  in  which  subject-to-subject  variation  of  complex  body  mechanisms  for  deal- 
ing with  medications  resulted  in  only  a  subset  of  subjects  reacting  to  treatment. 
Carano  and  Moore  (1982)  were  interested  in  studying  Sister  Chromatid  Exchanges 
(SCE)  as  an  indicator  of  carcinogenicity  in  human  smokers.  A  larger  frequency  of 


high  SCE  values  per  cell  appeared  in  many  subjects  who  were  smokers,  but  subject- 
to-subject  variation  was  highly  prevalent  and  made  it  difficult  to  detect  a  difference 
in  SCE  distributions  for  smokers  and  nonsmokers.  The  damage  in  cells  was  de- 
scribed as  persistent  although  only  occuring  in  a  small  fraction  of  them.  Data  for 
a  similar  situation  where  chemotherapy  is  a  toxic  substance  that  is  hypothesized 
to  increase  SCE  counts  in  some  cells  is  given  in  the  Johnson  et  al.  (1987)  article. 
The  counts  for  a  sample  of  cells  from  a  patient  are  measured  before  undergoing  the 
therapy  and  again  for  a  second  sample  taken  after  administration. 

The  examples  of  the  nonresponders  phenomenon  described  thus  far,  particularly 
the  last  one  of  chemotherapy,  involve  data  that  are  completely  observable.  A  nat- 
ural extension  is  the  scenario  in  which  some  of  the  observations  are  incomplete,  or 
censored.  This  is  a  feature  for  responses  of  the  time-to-event  nature,  such  as  time- 
to-relapse,  time-to-death,  time-to-recurrence,  etc.  A  subject's  observation  may  only 
be  known  to  exceed  some  value.  In  medical  studies  this  aspect  of  censoring  produces 
what  is  called  survival  data. 

A  clinical  trial  of  cancer  patients  who  receive  one  of  two  therapies  will  involve 
data  of  the  time-to-event  nature  with  the  feature  of  censoring.  Patients  receiving 
conventional  radiation  therapy  are  considered  the  "controls,"  while  other  patients 
receiving  a  combination  of  radiation  and  chemotherapy  make  up  the  treatment 
group.  Conceivably,  there  may  be  patients  where  the  chemotherapy  has  no  effect, 
and  those  treated  nonresponder  patients  will  have  survival  times  like  the  control 
patients.   Clinical  trials  invariably  produce  patients  surviving  at  time  of  analysis, 


or  perhaps  patients  "lost  to  follow-up"  by  reason  of  moving  away,  or  death  due  to 
a  cause  unrelated  to  treatment.  Such  patients  will  have  censored  times. 

Other  medical  studies  attempt  to  assess  the  effects  of  transplanted  organs  on 
survival.  The  human  body's  tendency  to  reject  foreign  tissue  may  render  the  trans- 
plantation ineffective,  and  the  individual  dies  just  as  if  no  transplant  was  received. 

1.2     Overview  of  this  Manuscript 

The  presence  of  nonresponders  in  the  treatment  group  is  considered  in  this 
dissertation.  The  potential  of  censoring  on  the  collected  data  is  taken  into  account. 
Standard  techniques  as  well  as  newly  developed  methods  for  detecting  a  treatment 
effect  in  the  presence  of  nonresponders  and  censored  data  are  investigated. 

Chapter  Two  presents  statistical  formulation  of  the  nonresponder  problem.  A 
model  to  describe  this  situation  is  given.  Techniques  put  forth  in  the  statistical 
literature  for  the  nonresponder  problem  are  reviewed.  These  techniques  consider 
uncensored  data.  Chapter  Three  contains  the  development  of  methods  for  censored 
data.  Specific  forms  for  these  methods  under  chosen  error  distributions  are  given 
in  Chapter  Four.  Efficiencies,  simulations,  and  illustrations  on  real  data  in  order  to 
compare  new  methods  with  the  standard  methods  make  up  Chapter  Five.  Summary 
and  conclusions  are  in  Chapter  Six. 

The  referencing  system  in  this  manuscript  will  follow  the  convention  of  number- 
ing equations  within  section.  Tables  and  figures  will  be  numbered  within  chapter. 
Theorems,  Corollaries,  and  Lemmas  are  counted  within  section  and  are  numbered 


independently  of  one  another.  For  example,  the  first  theorem  in  Section  2.3  is  Theo- 
rem 2.3.1,  and  the  first  lemma  in  Section  2.3  is  Lemma  2.3.1.  The  second  referenced 
equation  in  that  section  is  (2.3.2). 


CHAPTER  2 
STATISTICAL  FORMULATION  OF  THE  NONRESPONDERS  PROBLEM 


2.1     Introduction 

Statistical  models  for  the  nonresponders  problem  have  been  consistently  written 
in  terms  of  cumulative  distribution  functions  in  the  literature.  To  accomodate  the 
prevailing  convention  in  the  treatment  of  censored  data,  we  shall  present  our  models 
in  terms  of  survival  functions. 

In  Section  2.2,  models  suitable  for  statistical  treatment  of  the  nonresponders 
problem  are  formally  presented  along  with  notations  defined  for  right-censored  data. 
A  review  of  statistical  research  literature  pertinent  to  the  modelling  for  the  nonre- 
sponders problem  is  contained  in  Section  2.3. 

2.2     The  Mixture  MoHpI 

Let  X0  denote  the  response  variable  associated  with  members  in  the  control 
group,  and  let  Xt  denote  the  response  variable  for  those  subjects  in  the  treatment 
group.  Call  R  the  indicator  variable  which  delineates  whether  or  not  a  treated 
subject  is  a  responder,  that  is, 

{1,     if  the  treated  subject  is  a  responder; 
0,     if  the  treated  subject  is  a  nonresponder. 


Then  the  true  proportion  of  responders  in  the  treated  population  is  given  by 

7T  =  Pr(R  =  1).  (2.2.1) 

The  distribution  of  X\,  the  response  for  a  treated  subject,  can  be  represented  as 
a  mixture  of  two  conditional  distributions — one  the  distribution  of  the  response  for  a 
responder  subject  and  the  other  the  distribution  of  the  response  for  a  nonresponder 
subject.  Let  the  associated  survival  functions  for  the  two  conditional  distributions 
be  denoted  by 

Fr(x)  =  Pr(*x  >  x  |  R  =  1);         FNR(x)  =  Pr(X,  >  x  |  R  =  0), 

respectively.  Then  with  7r  as  in  (2.2.1), 

G(x)    =    Pr(*,  >  x) 

=    Pr(*,  >  x  |  R  =  l)Pr(i?  =  1)  +  pT(X1  >x\R  =  0)Pr(R  =  0) 
=    vFR(x)  +  (1  -  ir)FNR(x)  (2.2.2) 

is  the  survival  function  associated  with  the  distribution  of  Xx. 

The  following  assumptions  are  made  throughout  the  remainder  of  this  disserta- 
tion. 

Al.  The  distribution  of  the  measured  value  for  a  subject  in  the  control  group 
is  the  same  as  the  distribution  of  a  measured  value  for  a  nonresponding 
subject  in  the  treatment  group.  That  is, 

F(x)  =  Pv(X0  >x)  =  FNR(x). 


A2.  The  measured  value  for  a  responding  subject  in  the  treated  group  is 
stochastically  larger  than  the  measured  value  for  a  treated  nonresponder. 
That  is, 

Fr{x)  >  Fnr(x)     for  all  x  with  strict  inequality  for  at  least  one  x. 

A3.  Both  Fr  and  F^r  are  members  of  a  family  of  survival  functions  indexed 
by  a  real  parameter  A: 

T={F{x:  A):0<A<+oo},  (2.2.3) 

where  F(x  :  A)  is  continuous  with  density  f(x  :  A)  and  is  such  that 

F{x  :  0)  =  FNR(x)  =  F(x). 

Before  proceeding  further,  some  comments  concerning  the  three  assumptions 
are  in  order.  As  noted  before,  Al  is  a  reasonable  assumption  in  many  practical 
settings.  It  implies  that  nonresponding  subjects  in  the  treated  group  behave  like 
the  subjects  in  the  control  group.  The  second  assumption,  A2,  can  be  interpreted 
as  implying  that  the  treatment  effect,  if  present,  will  increase  the  probability  that 
the  measured  value  for  a  responding  subject  will  be  larger  than  a  given  value.  Of 
course,  the  methods  that  are  developed  under  A2  can  be  modified  to  the  case  where 
A2  is  replaced  by  the  assumption 

A2'.  the  measured  value  for  a  responding  subject  in  the  treated  group  is 
stochastically  smaller  than  the  measured  value  for  a  treated  nonrespon- 
der. That  is, 

Fr{x)  <  FNR(x)     for  all  x  with  strict  inequality  for  at  least  one  x. 


Assumption  A3,  and  particularly  (2.2.3),  when  used  in  the  mixture  model  at 
(2.2.2),  implies  that  the  survival  function  for  an  experimental  subject  has  the  form 

G{x:tt,A)  =  ttF(x:  A)  +  (1  -  x)F(x),         0  <  x  <  1,     0<A<oo,      (2.2.4) 

where  F(x)  =  F(x  :  0)  is  the  survival  function  for  the  control  subjects.  The 
corresponding  density  function  is  given  by 

g[x  :tt,A)  =  tt/(x:  A)  +  (1  -  ir)f(x),  0  <  tt  <  1,      0<A<oo.        (2.2.5) 

The  survival  functions  at  (2.2.4)  represent  a  two-parameter  family  of  models  for 
the  two-sample  nonresponders  problem.  In  this  family,  the  claim  that  "there  is  a 
treatment  effect"  can  be  interpreted  as  the  claim  "a  positive  proportion  of  treated 
subjects  are  responders."  Thus,  a  test  of  the  null  hypothesis 

Ho  :  No  treatment  effect, 

against  the  research  hypothesis 

Ha  :  There  is  a  treatment  effect, 

can  be  performed  by  testing 

H0:tt  =  0     vs.     Ha  :  tt  >  0.  (2.2.6) 

An  examination  of  the  nonresponders  model  at  (2.2.4)  shows  that  the  hypotheses 
at  (2.2.6)  are  equivalent  to  the  hypotheses 

H0  :  G(x  :  tt,  A)  =  F(x)     vs.     Ha  :  G(x  :  tt,  A)  >  F(x).  (2.2.7) 
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The  research  hypothesis  at  (2.2.7)  means  that  X\  is  stochastically  larger  than  X0. 
Thus,  any  two-sample  rank  test  for  stochastic  ordering,  such  as  the  Wilcoxon  rank 
sum  test,  will  provide  a  nonparametric  test  for  testing  H0  vs.  Ha.  However,  since 
these  tests  do  not  use  the  specific  form  of  G(x)  under  Ha,  it  is  natural  to  search 
for  tests  that  are  more  efficient  for  testing  for  treatment  effects  under  the  model  at 
(2.2.4). 

For  the  case  where  there  is  no  censoring  of  the  data,  several  authors  (Good, 
1979;  Boos  and  Brownie,  1986;  Johnson  et  al.,  1987;  Conover  and  Salsburg,  1988) 
have  developed  tests  of  H0  assuming  one  of  the  following  forms  for  F(x  :  A): 

(i)  Location  model:  F(x  -  A),      -oo  <  x  <  oo,     A  >  0; 

(ii)        Scale  model:  F(x/eA),      x  >  0,      A  >  0; 

(iii)  Lehmann  model:  [F(x)]e~*  ,      -oo  <  x  <  oo,      A  >  0; 

(iv)  Lehmann  model:  1  -  [1  -  F(x)]eA  ,      -oo  <  x  <  oo,     A  >  0. 

Note  that  F(x  :  A)  is  an  explicit  function  of  F(x)  in  the  last  two  cases. 

In  the  context  of  survival  data,  the  model  for  scale  alternatives  appears  more 
appropriate  because  survival  times  are  nonnegative  random  variables  whose  dis- 
tributions are  often  skewed.  However,  since  the  natural  logarithm  transformation 
T(x)  =  log(x)  transforms  the  scale  alternatives  to  location  alternatives,  rank-based 
tests  for  the  case  of  a  scale  model  are  identical  to  those  for  the  location-shift  model 
under  the  log  transformation.  Accordingly,  in  the  remainder  of  this  work  attention 
is  confined  to  location  models  only. 
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The  models  (iii)  and  (iv)  have  been  explored  by  Conover  and  Salsburg  (1988). 
Their  models  are  expressed  in  terms  of  cumulative  distribution  functions.  Indeed,  if 
we  assume  our  model  (iii),  then  it  is  straightforward  to  show  that  the  corresponding 
cumulative  distribution  function  is 

1  -  G{x  :  x,  A)  =  x  [l  -  [F(x)]e_A]  +  (1  _  x)  [1  -  F{x)\ , 

which  is  their  "Model  2."  It  should  also  be  noted  that  their  parameter  a  is  related 
to  A  through  the  relationship 

a  =  eA,  for  a  >  1. 

Similarly,  under  model  (iv), 

1  -  G(x  :  x,  A)  =  x  [1  -  F(x)}eA  +  (1  -  x)  [1  -  F(x)} 
corresponds  to  the  Conover  and  Salsburg  (1988)  "Model  1." 

A  Graphical  Example 

Consider  the  location-shift  model 

G{x  :  x,  A)  =  xF(x  -  A)  +  (1  -  w)F(x) 

for  (2.2.4).  Values  of  tt,  the  true  proportion  of  responders  in  the  treated  population, 
will  influence  the  shape  of  the  density  in  (2.2.5): 

g(x  :  tt,  A)  =  nf(x  -  A)  +  (1  -  tt)/(x). 

The  shape  of  g{x  :  tt,  A)  will  match  the  shape  of  the  density  f(x)  for  X0  when  tt  =  0 
or  tt  =  1.  The  degree  of  modification  to  the  density  shape  increases  as  the  value  of 
A  tends  away  from  zero  and/or  the  value  of  tt  tends  away  from  either  zero  or  one. 
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For  the  purpose  of  illustration,  let  us  consider  the  case  where  the  random  variable 
Xi\R  =  1  has  the  extreme-value  distribution  with  density 

f{x  :  A)  =  f{x  -  A)  =  exp  (-(*  -  A)  -  e_(—A>)  . 

The  series  of  graphs  comprising  Figures  2.1-2.4  indicate  how  the  values  for  x 
and  A  dictate  right-tail  behavior  for  the  distribution  of  X\.  The  location  shift 
parameter  A  is  labelled  Delta  in  the  following  graphs.  The  parameter  *  is  labelled 
Pi.  When  A  =  0.5,  the  shapes  of  the  distributions  for  Xx  (solid  line  density  for 
the  treatment  population)  and  XQ  (dotted  line  density  for  the  control  population) 
remain  quite  similar  for  the  range  of  values  of  w.  Differences  in  the  densities  only 
become  apparent  for  values  of  7r  >  0.4.  In  comparison,  a  distinction  between  the 
densities  of  X0  and  X\  when  A  =  1  can  be  seen  as  early  as  for  the  value  x  =  0.2. 
The  shape  for  the  density  associated  with  X\  "flattens  out"  at  around  tt  =  0.5  and 
"recovers"  as  it  moves  away  from  0.5.  Figures  2.3-2.4  pronounce  this  phenomenon 
more  clearly,  with  suggestion  of  a  bimodal  shape. 

Figures  2.1-2.4  give  us  an  example  of  how  a  location  shift  parameter  A  and  t 
interact  in  terms  of  their  influence  on  the  density  shape  for  the  treatment  group. 

Random  Censoring 

The  presence  of  censoring  in  survival  data  directly  affects  the  information  in 
samples  from  the  governing  model.  Several  kinds  of  censoring  mechanisms  have 
been  investigated  in  the  statistical  literature,  each  different  in  its  influence  on  con- 
struction of  a  likelihood  for  the  data.  (For  example,  refer  to  Kalbfleisch  and  Prentice 
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Figure  2.1:    Graph  of  extreme- value  mixture  model;  A  =  0.5. 
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Figure  2.3:    Graph  of  extreme-value  mixture  model;  A  =  2.0. 
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(1980,  Ch.  5)  for  a  survey  and  discussion  of  the  different  censoring  types  and  their 
appropriateness.) 

There  are  N  subjects  under  study,  of  which  iV*o  are  in  the  control  group  and 
iVj  are  in  the  treatment  group.  Let  r  =  0  for  the  control  group  and  r  =  1  for  the 
treatment  group.  Associated  with  the  ith  subject  in  the  rth  group  are  two  random 
variables: 

Xri    =    the  potential  failure  time  for  the  ith  subject  in  group  r, 
CTt     =    the  potential  censoring  time  for  the  zth  subject  in  group  r. 

For  the  ith  subject  in  group  r,  we  observe  the  values  of  the  random  variable 

Xri  =  min(Xr,-,Crt) 

and 


6Ti  = 


1 ,       if  Xri  —  XTi  i 


lo,   ifX,  =  cri. 

Thus  6Tl  takes  on  the  value  1  if  the  observation  for  subject  i  in  group  r  is  not 
censored  and  0  otherwise. 

Under  the  random  censorship  model,  we  assume 

(1)  (X0i,Coi),. . .  ,{^oN0,CoN0),{^nXn),---,(XiNl,CiNi),  are  mutually  in- 
dependent pairs. 

(2)  Xri  and  CT,  are  independent  and  continuous,  with  survival  functions 
G{x  :  7r,  A)  and  L(x),  respectively. 

(3)  The  form  of  L(x)  does  not  depend  upon  w  or  A. 
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The  three  assumptions  of  random  censorship  implies  a  noninformative  structure 
about  the  censoring  mechanism.  Most  of  the  results  in  the  statistical  literature  are 
developed  under  these  assumptions.  One  feature  of  the  noninformative  structure  is 
an  opportunity  to  construct  a  partial  likelihood  for  the  observed  data. 

Other  Notations 

When  it  is  necessary  to  use  the  random  variable  associated  with  the  survival 
function  F(x),  the  notation  of  either  F  or  F(X)  will  be  employed  in  this  dissertation. 
This  convention  applies  to  other  random  variables  as  well.  A  (fixed  or  random) 
function  that  takes  on  the  values  F(x)  at  the  point  x  may  be  denoted  by  either 
F(-)  or  F. 

The  notation 

Fr(x)  =  G(x)  ■  L{x)  =  ?r{Xri  >  x) 

is  for  the  survival  function  of  Xr,.  Also, 
denotes  the  hazard  function  at  time  x,  and 


A(x)=  [*  \{s)ds 
Jo 


denotes  the  cumulative  hazard  function  at  time  x.  The  integral  of  a  function  such 

as  X(s)  with  respect  to  the  variation  of  s  over  some  interval  in  9?  (the  real  line)  will 

be  denoted  as  in 

/    X(s)ds. 
Jo 
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This  last  integral  could  also  be  represented  as  a  Lebesgue-Stieltjes  integral  with 
respect  to  the  total  variation  of  A(x),  such  as  in 


/      dA(s). 


Estimates  from  the  data  of  such  quantities  will  be  denoted  using  the  "hat"  notation; 
e.g.  F{x)  is  some  reasonable  estimate  of  F(x). 

Realizations  of  Xrx  are  denoted  by  xri  whether  censoring  occurs  or  not.  Now  sup- 
pose that  we  wish  to  refer  to  all  the  observations  irrespective  of  group  membership. 
Dropping  the  subscript  r  from  xri  produces 

Xi,..  .,xn 

as  the  observed  times  for  the  N  subjects  in  the  combined  sample.  Let  k0  (*j)  denote 
the  number  of  distinct  uncensored  times  in  the  control  (treatment)  sample,  and  set 
k  =  k0  +  ki .  Let 

0  =  i(0)  <  x{1)  <  X(2)  <  ...  <  X(k)  <  x(k+1)  =  oo 

represent  the  ordered  uncensored  times  in  the  combined  sample.  Within  the  interval 
[•£(,), X(,+i)), z  =s  0,  ...,k,  suppose  that  there  are  m,  total  censored  observations 
3(.)i>--  ■^(i)m,-  Define 

{1,     if  X(i)  corresponds  to  a  treated  subject; 
0,     otherwise. 
Thus  z(t)  is  the  group  membership  indicator  corresponding  to  x(i).    The  censored 


values  x{iK,  with  {  =  l,...,m,-,  analogously  have  the  covariates  zim  to  indicate 
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their  group  memberships.  Let 

m, 

Then  Mi  is  the  number  of  censoring  times  belonging  to  the  treatment  group  in  the 
interval  [x(i),x(l+1)).  So  m,  -  M,  is  the  number  of  censored  observations  in  the 
control  group  that  reside  in  [x{i),x{t+1)).  The  quantity 

k 

Ri  =  EK  + 1) 

will  be  called  "the  size  of  the  risk  set  at  time  xtf  and  is  the  number  of  subjects 
whose  values  are  known  to  be  at  least  x{i).  The  quantities  m,-,M„  and  R,  do  not 
have  indices  in  parentheses  since  they  are  examples  of  functions  of  the  x(i).  Note 
that  2(l)  does  not  follow  this  convention. 

2.3     Literature  Review 

Subrahmanian,  Subrahmanian,  and  Messeri  (1975)  noted  that  the  use  of  the 
independent  sample  Student's  t-test  will  suffer  from  reduced  power  when  the  alter- 
native  hypothesis  specifies  a  mixture  of  normal  distributions.  Good  (1979)  appears 
to  be  the  first  to  introduce  the  mixture  model  to  account  for  the  existence  of  non- 
responders  in  the  treatment  group.  To  account  for  the  fact  that  the  presence  of 
nonresponders  will  decrease  the  difference  between  the  means  of  the  two  groups 
and  increase  the  variance  of  the  treatment  group,  Good  (1979)  proposed  a  new 
randomization  test  based  on  the  statistic 


.-.  12 


[*i-A"o]' 
1/(0.67)  =  0-67 l  J    +  (i  _  0.67)(M  -  1)5?,  (2.3.1) 
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where  X0  (X\)  is  the  sample  mean  for  the  control  (treatment)  group,  and  Sf  is  the 
usual  sample  variance  for  the  treatment  group.  Notice  that  the  test  statistic  is  a 
weighted  sum  of  functions  involving  the  difference  in  group  means  and  the  variance 
for  the  treatment  group.  The  choice  of  0.67  by  Good  (1979)  as  the  weight  was  made 
in  hopes  of  wide  applicability  of  the  test  over  the  unknown  range  of  values  for  t 
and  A.  Other  weights  within  the  interval  [0,1]  could  be  used — a  weight  of  1.0  gives 
Fisher's  randomization  t-test. 

The  empirical  power  investigations  of  Good  (1979,  Table  1)  determined  that 
the  test  based  on  i/(0.67)  in  (2.3.1)  performed  markedly  better  over  the  range  of 
values  7r  G  (0,0.8]  than  Student's  t-test  for  sample  sizes  under  ten  when  F(x  :  A) 
represented  the  standard  normal  distribution  and  A  =  1,2,  or  4.  It  also  beat  out 
Fisher's  randomization  t-test  over  the  entire  range  ir  €  (0, 1]  when  the  underlying 
distributions  were  log-normal.  Applications  of  the  test  based  on  ^(0.67)  in  (2.3.1) 
showed  its  evidence  of  increased  "sensitivity"  (Good,  1979)  relative  to  Student's  t 
for  real  data  that  exhibit  the  nonresponders  situation.  Presumably,  the  existence  of 
greater  variability  in  addition  to  a  shift  in  location  for  the  treatment  group  responses 
made  the  test  based  on  (2.3.1)  better  suited  than  the  t-test  for  rejecting  the  null 
hypothesis  of  "no  treatment  effect." 

Boos  and  Brownie  (1986)  addressed  some  interesting  questions  raised  by  Good's 
(1979)  paper.  The  interpretation  of  a  significant  p- value  was  one  issue.  Boos  and 
Brownie  (1986)  emphasized  that  acceptance  of  the  alternative  hypothesis  does  not 
necessarily  imply  a  shift  in  mean.    An  increase  in  the  variability  could  be  solely 
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responsible.  The  graphs  in  Figure  2.4  indicate  how  a  small  7r  value  and  a  large  A 
value  produces  more  variability  in  the  treatment  group  distribution  although  the 
means  of  the  two  groups  are  similar.  In  addition  to  correcting  the  procedure  for 
implementing  a  one-sided  test  with  (2.3.1),  Boos  and  Brownie  (1986)  used  Monte 
Carlo  study  results  to  argue  that  Student's  t-test  and  the  Wilcoxon  rank  sum  test 
are  at  least  as  powerful  as  the  test  based  on  (2.3.1)  when  the  underlying  distributions 
were  normal  or  extreme- value  and  tt  >  0.6.  The  advantage  of  (2.3.1)  appeared  as  A 
increased  and  n  was  less  than  0.6.  Group  sample  sizes  of  eight  were  used.  A  study 
with  group  sample  sizes  of  twenty  did  not  include  (2.3.1);  Boos  and  Brownie  (1986) 
justified  its  omission  with  a  statement  by  Good  (1979)  of  diminished  effectiveness 
of  (2.3.1)  as  NQ  and  N%  grew  larger. 

The  non-technical  article  by  Salsburg  (1986),  which  discussed  several  applica- 
tions of  the  nonresponder  phenomenon,  indicates  that  an  interpretation  of  some 
change  in  the  treatment  group  whether  it  be  primarily  due  to  variability  or  shift  in 
mean  is  useful  in  preliminary  investigations  of  new  clinical  compounds.  The  sample 
sizes  in  such  studies  are  usually  small,  so  that  the  presence  of  nonresponders  will 
decrease  power  of  tests  such  as  Student's  t  and  the  Wilcoxon. 

The  use  of  Lehmann  alternatives  in  the  mixture  model  for  the  treatment  group 
was  proposed  in  a  subsequent  article  authored  by  Conover  and  Salsburg  (1988).  Un- 
like the  articles  of  Good  (1979)  and  Boos  and  Brownie  (1986),  a  formal  mechanism 
to  derive  tests  sensitive  to  the  alternative  hypothesis  was  used.  The  mechanism  was 
Conover's  (1973)  technique  for  deriving  a  locally  most  powerful  rank  test  (LMPRT) 
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under  more  general  models  than  the  usual  location-shift  or  scale  change  settings. 
The  LMPRT  is  a  linear  rank  test  that  is  a  sum  of  expected  or  approximate  scores 
specifically  generated  from  the  given  model  and  assigned  to  the  observations. 

Recall  that  the  mixture  model  (2.2.4)  has  two  parameters,  it  and  A.  Conover  and 
Salsburg  (1988)  took  the  approach  of  fixing  one  of  the  parameters  and  deriving  the 
LMPRT  scores  for  the  other.  In  both  cases,  the  score  functions  were  independent 
of  jr.  Since  both  7r  and  A  indicate  effect  due  to  treatment  in  the  model  at  (2.2.4), 
Conover  and  Salsburg  (1988)  were  interested  in  a  loss  of  power  that  would  arise 
in  using  one  score  function  when  the  other  score  function  was  more  appropriate. 
A  comparison  of  the  two  score  functions  in  terms  of  Pitman  asymptotic  relative 
efficiency  (ARE)  lent  creedence  to  a  compromise  score  function  based  on  a  value  of 
A  that  maximized  the  Pitman  ARE. 

Test  statistics  based  on  the  scores  developed  by  Conover  and  Salsburg  (1988) 
were  shown  to  be  asymptotically  normal  in  distribution.  For  smaller  samples, 
Conover  and  Salsburg  (1988)  suggest  the  use  of  the  two-sample  Student's  t-test 
computed  on  the  scores.  Monte  Carlo  studies  indicated  that  the  empirical  signif- 
icance levels  for  the  tests  were  close  to  the  chosen  nominal  significance  level  for 
sample  sizes  as  few  as  five  in  each  of  the  groups  when  the  t-test  approximation  was 
employed.  Applying  the  tests  on  two  examples  of  real  data  indicates  better  perfor- 
mance than  Student's  t,  the  Wilcoxon,  and  Good's  test  based  on  (2.3.1)  in  the  sense 
of  calculated  p- values.  One  example,  which  compares  pain  values  for  control  and 
treated  patients  undergoing  acute  painful  diabetic  neuropathy  had  sample  sizes  of 
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N0  =  N\  =  10.  The  other  example  of  SGOT  liver  function  values  for  heart  patients 
had  samples  sizes  of  A^  =  28  and  N%  =  30. 

A  similar  formal  approach  of  ranks  was  taken  by  Johnson  et  al.  (1987),  where  a 
location-shift  was  postulated  for  F(x  :  A).  Johnson  et  al.  (1987)  clearly  stated  the 
testing  problem  to  be  the  one  considered  in  this  manuscript  at  (2.2.6): 

Ho  :   7r  =  0  vs.    H a  :   w  >  0. 

The  linear  rank  tests  derived  by  Johnson  et  al.  (1987)  were  noted  to  be  based  on 
score  functions  that  accentuated  large  responses,  which  is  contrary  to  the  usual 
aspect  of  score  functions  for  rank  tests  such  as  the  Wilcoxon.  The  example  in 
Chapter  One  of  this  manuscript  that  referred  to  higher  SCE  counts  for  patients 
receiving  chemotherapy  served  as  an  illustration  of  the  usefulness  of  the  tests  derived 
by  Johnson  et  al.  (1987).  In  asymptotic  relative  efficiencies  and  Monte  Carlo  studies, 
the  derived  tests  performed  better  than  standard  tests  such  as  the  Wilcoxon  for  a 
broad  range  of  configurations  that  reflect  the  nonresponders  problem. 

Two  underlying  forms  for  F(x  :  A)  were  considered — uniform  and  normal.  The 
normal  distribution  with  a  location  shift  generated  a  score  function  that  was  de- 
pendent on  A.  The  resulting  linear  rank  test  could  then  be  used  to  get  an  exact 
conditional  distribution  under  the  null  hypothesis.  As  the  sample  sizes  No  and 
Ni  increase,  computations  become  prohibitive  and  an  asymptotic  distribution  is 
convenient.    Also,  the  score  functions  may  not  have  tractable  expectations  so  that 
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approximate  scores  that  are  asymptotically  equivalent  are  useful.  Results  for  dealing 
with  these  problems  and  others  are  systematically  dealt  with  in  the  Johnson  et 
al.  (1987)  article.  The  following  subsection  summarizes  their  findings. 


Properties  of  Uncensored  Data  Linear  Rank  Tests 

Under  the  mixture  model  at  (2.2.4),  the  score  function  assigned  to  Xt{\  is  <?i>(ti,), 
where 

u,  =  1  -  F(x(i)  :  0)  =  1  -  F(x{i)), 


and 


n  ]      f(F-m-u)) 


(2.3.2) 


f(F-*(l-u)) 
The  function  (2.3.2)  will  be  subsequently  designated  the  "mixed-model"  score  func- 


tion. 


The  form  of  the  statistic  for  the  LMPRT  for  testing  H0  :  tt  =  0  vs.  Ha  :  tt  >  0  is 


N 


r=s  23  *<•■)£  (#"•■))  i 


(2.3.3) 


t=i 


with  the  expectation  £  taken  over  the  joint  density  of  the  order  statistics  of  a 
random  sample  of  size  N  from  a  uniform  (0,1)  distribution.  The  fact  that  the  ith 
order  statistic  from  a  uniform  (0,1)  distribution  has  a  beta  distribution  with  density 


AM 


{i  -  1)1  (N-i)\ 


u 


j-,(l-ti,)<**1-0-»J         0  <«,<!, 


can  be  utilized  to  evaluate  T  under  specific  forms  for  <j>  such  as  when  the  underlying 
distributions  F(x  —  A),      A  >  0  are  normal. 
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Exact  significance  levels  associated  with  T  at  (2.3.3)  can  then  be  calculated  via 
the  permutation  principle.  The  score  function  (2.3.2)  can  also  be  used  with  the 
approximation 

£(<f>(Ui))  «  <!>[£(*)]  =  <f> 


N  +  l\ 


to  get  a  statistic 


r'=|><.*GvTr)  (2-3-4) 

that  is  asymptotically  equivalent  to  T  at  (2.3.3).  The  approximate  scores  not  only 
have  easier  computational  forms  but  also  possess  closed-form  expressions  for  most 
of  the  common  distributions. 

Since  computations  with  all  possible  permutations  can  quickly  become  too  in- 
tensive as  N  increases,  asymptotic  results  are  employed  for  assessing  significance 
levels  when  jV  is  large.  Under  mild  regularity  conditions,  standardized  versions  of 
both  (2.3.3)  and  (2.3.4)  will  be  asymptotically  normally  distributed  under  the  null 
hypothesis. 

THEOREM  2.3.1   Assume  that 

(i)  <f>(u)  as  defined  by  (2.3.2)  is  monotonic. 

l 

(ii)  f  <t>2(u)du  <  oo. 
o 

If  v  is  either  (2.3.3)  or  (2.3.4),  then  under  the  null  hypothesis  H0  :  r  =  0, 

v 
yVar(i/)' 
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where 

Var(.)  =  -Mj-  f  <?{u)du, 

I\0  +  iVi  Jo 

converges  in  distribution  to  a  standard  normal  random  variable  as  mm(No,Ni)  —fr- 
ee. 

Proof.   See  Johnson  et  al.  (1987),  Theorem  3.1.  □ 

Behavior  of  the  statistics  T  at  (2.3.3)  and  T*  at  (2.3.4)  under  alternative  hy- 
potheses has  been  handled  in  the  context  of  sequences  of  local  parameter  values. 
Johnson  et  al.  (1987)  provide  formal  details  in  the  appendix  of  their  article.  The 
distributions  of  T  and  T*  under  sequences  of  local  alternative  hypotheses  are  asymp- 
totically normal  and  can  be  used  to  obtain  efficiency  expressions.  As  mentioned 
before  in  review  of  the  Conover  and  Salsburg  (1988)  article,  the  Pitman  ARE  can 
be  used  to  help  determine  a  value  for  A  in  order  to  conduct  the  test.  Let  a  >  0  be 
some  constant.  Under  a  limiting  sequence  of  local  alternatives 

^  =  1=,         min(Ar0,Ari)  ^  oo,  (2.3.5) 

the  Pitman  ARE  can  be  computed  to  explore  loss  of  efficiency  for  a  particular  LM- 
PRT  due  to  inaccurate  specifications  such  as  the  value  of  A.  Since  the  test  statistics 
depend  on  A,  a  choice  of  some  value  to  substitute  for  a  unknown  parameter  is  an 
important  issue.  This  section  introduces  some  evidence  of  the  problem.  Further 
study  in  hopes  of  finding  adequate  solutions  is  discussed  in  later  chapters,  after  the 
censored  data  case  is  considered. 


Suppose  <f>ch(u)  denotes  the  chosen  score  function  to  be  used,  while  <f>t{u)  denotes 
the  score  function  which  contains  the  true  value  of  A.  That  is,  let 

/(F-*(l-u)-Ac,) 


and 


/(F-»(l-«)) 


/(F-*(l-u)-AQ 


/(F-»(l-u)) 

where  Ac^  is  the  chosen  value  of  A,  and  At  is  the  true  value  of  A. 

THEOREM  2.3.2    Under  the  conditions  of  Theorem  2.3.1  and  (2.3.5),  the  Pitman 
ARE  of  the  test  statistic  v  based  on  <j>ch(u)  relative  to  <pt(u)  is 


eff(AcA,At)  = 


/ 4>ch(u)(j)t(u)du 

0 


f(f>lh{u)du 


j $(u)du 

0 


Proof.   See  Johnson  et.  al  (1987),  Theorem  3.2.  □ 

The  computation  of  a  Pitman  ARE  may  be  simplified  with  the  next  result. 

LEMMA  2.3.1   For  the  mixed-model  uncensored  score  function  <j){u), 

i 

[  </>(u)du  =  0, 
o 

or  equivalently, 


/ 


/(F-'(l-u)-A) 

/(F-i(l-u)) 


du  =  1. 


Proof.   The  integration  substitution  x  =  F_1(l  —  u)  produces 


J  <j>{u)du    =     J  /(^f)/( 


/(*) 


=     1  -  1  =  0.  □ 
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Cross  product  terms  that  arise  when  components  of  the  Pitman  ARE  are  ex- 
panded out  will  involve  the  same  type  of  integral  as  given  in  Lemma  2.3.1,  thereby 
leading  to  simplified  evaluations. 


An  Example 

Theorem  2.3.2  can  be  used  to  evaluate  a  (Pitman)  ARE  for  some  particular 
distributions  when  the  unknown  parameter  is  misspecified.  Any  mention  of  ARE 
in  this  dissertation  implicitly  refers  to  Pitman  asymptotic  relative  efficiency. 

The  ARE  expression  in  the  theorem  can  also  be  employed  to  compare  the  mixed- 
model  scores  test  with  the  linear  rank  test  derived  under  the  usual  location-shift 
model.  Score  functions  such  as  those  for  the  Wilcoxon  and  log-rank  take  the  place 
of  4>ch{u).  The  mixed-model  scores  are  substituted  into  <j>t(u). 

An  illustration  using  the  normal  distribution  for  F(x  —  A)  is  now  presented. 
Recall  that  Johnson  et  al.  (1987)  chose  the  normal  distribution  to  derive  a  mixed- 
model  score  function.  First  we  note  that  because  <f>{u)  does  not  have  a  closed-form, 
the  expected  score  corresponding  to  X(,-j  must  be  numerically  evaluated.  A  general 
expression  for  the  expected  score  is 

AH  /• 

« =  (,-  - 1).  {N  -  ,yl«<*MrQ  -  «)'*»-»■'*.■ 

Now  refer  to  Theorem  2.3.1.  Since  all  components  of  the  first  derivative 


*'(«)  = 


AeAF-i(l-«) 

/(F-i(l-u)) 


exceed  0,  condition  (i)  holds.    Also,  condition  (ii)  holds  for  any  A  >  0,  since  by 
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Table  2.1:     Pitman  ARE  for  misspecified  A,  mixed  normal  distribution 


A, 


Ac/, 

0.5 

1.0 

1.5 

2.0 

2.5 

3.0 

0.5 

1.00 

.860 

.518 

.194 

.042 

.009 

1.0 

.860 

1.00 

.831 

.443 

.141 

.026 

1.5 

.518 

.831 

1.00 

.801 

.393 

.115 

2.0 

.194 

.443 

.801 

1.00 

.784 

.732 

2.5 

.042 

.141 

.393 

.784 

1.00 

.780 

3.0 

.009 

.026 

.115 

.372 

.780 

1.00 

straightforward  integration, 


/  4>2{u)du  = 


e*  -  1. 


This  provides  the  asymptotic  properties  of  v  when  the  underlying  survival  distribu- 
tion is  mixed  normal. 

For  studying  misspecification  of  A,  Table  2.1  is  an  extension  of  Table  3  in  John- 
son et.  al  (1987).  Again  by  straightforward  integration,  the  algebraic  form  of  the 
ARE  for  misspecified  A  is 

eff(Ac,,  A,)  =  -L-f- TT_ J   . 

Now  we  turn  to  model  misspecification.  Table  2.2  is  an  extension  of  Table  3  in 
Johnson  et  al.  (1987).  In  comparison  to  the  normal  scores  tests  of  Fisher  &  Yates 
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Table  2.2:     Pitman  ARE  for  location-model  vs.  mixed-model  linear  rank  test, 
mixed  normal  distribution 


A  for  mixed-model  score 

0.5 

1.0         1.5         2.0         2.5 

3.0 

0.880 

0.582     0.265     0.075     0.012 

0.001 

(1963)  and  van  der  Waerden  (1953),  Table  2.2  indicates  that  the  mixed-model  scores 
have  a  distinct  advantage  for  A  >  1.0.  This  was  pointed  out  in  Table  1  of  Johnson  et 
al.  (1987).  The  SCE  data  example  in  Johnson  et  al.  (1987)  makes  the  only  mention 
by  the  authors  on  a  choice  for  A  in  practice,  which  was  A  =  1.  A  look  at  the  results 
of  their  Monte  Carlo  study  suggests  that  the  mixed-model  test  when  the  underlying 
distributions  are  normal  has  better  power  than  the  standard  location-shift  normal 
scores  test  or  the  Wilcoxon  test.  This  was  particularly  true  for  true  values  of  A  =  2 
or  A  =  3  and  samples  sizes  of  10,  20,  and  40  per  group.  Choosing  A  =  0.5  or  1.0  or 
1.5  or  2.0  to  calculate  the  test  statistic  gave  very  similar  empirical  powers  in  most 
cases. 


Discussion 


The  articles  that  we  have  reviewed  in  the  literature  have  dealt  with  rank-based 
methods  for  the  nonresponders  problem  with  complete  observations.  A  believed 
advantage  of  deriving  scores  based  on  ranks  is  their  robustness  to  departures  from 
the  assumed  underlying  distributions  F(x  :  A).    Additionally,  the  invariance  to 
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monotone  transformations  is  a  worthwhile  feature.  The  property  of  locally  most 
powerful  rank  tests  is  appealing  as  well,  since  it  is  usually  the  case  that  such  rank 
tests  do  well  for  nonlocal  alternatives.  Extensions  of  these  procedures  constitute 
the  thrust  of  the  remainder  of  this  manuscript.  One  topic  is  the  use  of  other  dis- 
tributional forms  for  F(x  :  A)  other  than  the  normal.  The  main  interest,  however, 
is  to  develop  scores  and  linear  rank  tests  that  allow  for  the  potential  of  censored 
observations.  One  way  to  derive  such  scores  is  from  the  so-called  rank  likelihood. 
Once  such  scores  are  generated,  distributional  properties  need  to  be  considered  for 
inferential  purposes. 

The  issue  of  how  to  choose  A  to  conduct  the  test  in  practice  demands  more 
attention  than  what  has  been  given  in  the  literature.  To  this  end,  more  study  is 
warranted  on  the  behavior  of  the  tests  as  different  values  of  A  are  chosen. 


CHAPTER  3 
METHODS  FOR  CENSORED  DATA 


3.1     Introduction 

Investigation  of  the  properties  associated  with  a  linear  rank  test  statistic  for 
censored  data  requires  a  different  approach  than  that  for  the  case  where  no  censoring 
occurs.  A  useful  technique  in  similar  and  common  inference  situations  is  to  cast  the 
test  statistic  as  a  stochastic  integral.  This  enables  the  use  of  the  theory  of  counting 
processes  and  martingales  to  derive  asymptotic  properties.  Aalen  (1978)  and  Gill 
(1980)  were  responsible  for  much  of  the  foundational  work  of  this  approach.  Their 
findings  have  been  applied  numerous  times  by  researchers  in  the  area  of  censored 
data  linear  rank  statistics. 

In  Section  Two  of  this  chapter,  a  rank-based  likelihood  is  derived  for  censored 
data  under  the  mixture  model  (2.2.4).  The  likelihood  is  used  to  determine  the  form 
of  the  linear  rank  statistic,  //,  for  testing  H0  :  n  =  0.  The  equivalent  expression 
of  v  as  a  stochastic  integral  is  also  derived.  Section  Three  explores  asymptotic 
equivalence  of  linear  rank  statistics  when  v  employs  certain  scores  that  approximate 
the  expected  scores.  Also  introduced  are  some  preliminary  results  that  are  needed 
to  establish  the  asymptotic  normality  of  a  properly  standardized  version  of  v.  The 
asymptotic  properties  under  the  null  hypothesis  make  up  the  content  of  Section 
Four.   The  results  and  discussion  in  Section  Five  focus  upon  score  expressions  for 
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v  that  may  be  useful  in  practical  applications.    Properties  of  v  under  alternat 
hypotheses  are  presented  in  Section  Six. 


lve 


3.2     The  Linear  Rank  Test  Statistic  v 

Rank  Likelihood  for  the  Data 

Kalbfieisch  and  Prentice  (1973)  denned  a  censored  data  rank  statistic  as  the  set 
of  all  rank  vectors  of  uncensored  survival  times  that  can  give  rise  to  the  observed 
ranks  of  the  data.  The  probability  of  the  observed  censored  data  rank  statistic, 
P(r),  can  be  calculated  by  summing  up  the  probabilities  of  all  possible  underlying 
rank  vectors  given  the  censoring  pattern  of  the  data.  All  possible  rankings  of  the 
sample  that  might  be  observed  if  we  could  measure  the  actual  survival  times  for 
each  subject  are  added  together  to  get  P(r). 

The  Kalbfieisch  and  Prentice  (1973)  formulation  does  not  take  into  account  the 
order  of  the  censored  observations  inside  an  interval  defined  by  two  adjacent  ordered 
uncensored  values.  Sacrifice  of  this  ordering  information  should  be  negligible  in  the 
relative  presence  of  a  decent  number  of  uncensored  observations  (Prentice,  1978). 
Recall  the  mixture  model  at  (2.2.4)  as 

G{x  :  7T,  A)  =  G(x)  =  *F(x  :  A)  +  (1  -  ir)F(x), 

with  A  >  0  and  7T  G  [0, 1].  A  representation  of  P(r)  for  data  sampled  under  model 
(2.2.4)  and  the  random  censorship  structure  is 


35 


p(r)  =        /•  •  ■  /       II  /(»(0)1"<!01*/(*W  :  A)  +  (1  "  *)/(*«  :  0))Z(,) 


■E(l)<1(2)<—  <*(*) 

<  [F(xm  :  0)]m'-M'[7rF(x(,)  :  A)  +  (1  -  x)F(x{i)  :  0)]*  dx{i).      (3.2.1) 

Because  the  censoring  distributions  are  assumed  to  not  depend  on  ir  and  A,  the 
censoring  distributions  are  not  included  in  the  likelihood  expression  at  (3.2.1).  This 
"rank  likelihood"  allows  generation  of  scores  in  linear  rank  statistics  for  various 
choices  of  F(x  :  A),  A  >  0.  A  mechanism  for  deriving  linear  rank  tests  from 
P(r)  parallels  the  technique  for  constructing  locally  most  powerful  rank  tests  with 
uncensored  data  for  the  usual  types  of  alternatives — location,  scale,  and  regression 
(e.g.  Hajek  and  Sidak,  1967). 

A  General  Form 

There  are  three  conditions  of  Hajek  and  Sidak  (1967,  p.  70)  that  are  sufficient  for 
deriving  locally  most  powerful  rank  tests.  A  version  of  these  conditions  appropriate 
for  the  purposes  of  this  dissertation  is: 

(i)  For  some  t   >  0,  the  survival  function  G(x   :  ir,A)  and  the  density 
g(x  :  jt,  A)  are  continuous  in  7r  :  w  g  [0,  e]  for  every  x. 


(ii)  The  limits 
dG{x:n,&) 


dz 


^HmG(x:jr,A)-G(x:0,A) 


7T=0 


ir— 0 


=  lim 


G(x:tt,A)-F(x) 


*■— o 


and 
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dg(x  :  7r,  A) 


JT=0 


dir 
exist  for  every  x. 


=  lim  9{x:x,A)-g(x:0,A)  =         g(x  :  x,  A)  -  /(x) 


(Hi) 


oo 

/ 


dg(x  :  ir,  A) 


dn 


dx  <  oo  for  every  A 


and 


J 


dG(x  :  7T,A) 


dn 


f(x) 

F(x) 


dx  <  oo  for  every  A. 


We  now  introduce  the  general  form  of  v  in  the  following  theorem. 
THEOREM  3.2.1    Consider  the  two-sample  data  situation  given  by  (2.2.4): 
G(x)  =  G(x  :  7T,  A)  =  *F{x  :  A)  +  (1  -  v)F(x). 

Suppose  the  support  of  F(x)  is  contained  within  that  of  F(x  :  A),  and  there  exists 
a  positive  constant  B  <  oo  such  that 


/ 


F(x  :  A) 
F(x) 


f{x)dx  =  B. 


(3.2.2) 


Then  a  rank  test  for  testing 


HQ  :   ■k  =  0    vs.    Ha:   tt  >  0 


in  £/ie  model  (2.2.4)  can  be  based  on  the  statistic 


K 


»=1 


(3.2.3) 
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wh 


ere 


and 


Cl  =  j.--j  4>{Ui)  n{^(i  -  Uj)m>  dUj), 


Ct=  J...J*(ut)Y[{RJ(l-U}r>duj}, 


(3.2.4) 


(3.2.5) 


Ul<  —  <Ufc 


.7=1 


with 


and 


9[  }         f{F-\l-u)) 


*(tt) = F{ri(]~u):A)  - 1 = f{f~1{i  ~ u) :  A) 


-i. 


F(f-»(l-u))         "  (1-u) 

Proo/.    Following  Johnson  et  al.  (1987,  p.  654),  a  proof  of  Theorem  3.2.1  can  be 

given  by  verifying  the  three  conditions  on  page  35  of  this  manuscript.    For  some 

e  >  0,  (i)  the  survival  function  G(x  :  w,  A)  =  (1  -  tt)F(x)  +  ttF(x  :  A)  and  the 

density  g(x  :  x,  A)  =  (1  -  n)f(x)  +  wf(x  :  A)  will  be  continuous  in  n  :  t  e  [0,e] 

for  every  x.  The  one-sided  nature  of  the  interval  [0,  e]  does  not  affect  the  validity  of 

Hajek  and  Sidak's  conditions  according  to  the  argument  of  Johnson  et  al.  (1987). 

Next,  (ii)  the  limits 

,.      G(x:tt,A)-G(x:0,A)           ,.      G(x)  -  Fix) 
urn - -    =     hm — — — - 

=     lim(l-w)F{x)  +  TF(x:*)-F(x) 

7T— 0  W 

=    F{x:A)-F{x) 


and 


lim 

*■— o 


g(x:ir,A)-g(x:0,A) 


Iim^^/(X) 


7T ->0 


IT 
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=     lim(l-^)/(»)  +  ^/(»:A)-/(x) 


T-fO 


TV 


=    f(x:A)-f(x) 


exist  for  every  x;  and  (iii) 


+00 

/ 


\G{x:tc,A)-G(x:0,A)\ 
G(x:0,A) 

\F(x  :  A)  -  F(x)\ 
F(x) 

F(x  :  A) 


+00 

/ 

-00 
+00 


g{x  :  0,A)dx 


f(x)dx 


< 


■00  -00 

+I°F(X  :  A) 


F{x) 


=  B  +  l  <  00 


■f(x)dx  +  l 


when  the  supposition  at  (3.2.2)  holds,  and 


J  \g(x:7c,A)-g(x:0,A)\dx 

-00 

+00 

=  J  \f(x:A)-f(x)\dx 

—00 
+00  +00 

<  J  \f(x:A)\dx+  J  \f(x)\dx 


-00  -00 

=  1  +  1  =2  <  00. 


To  derive  the  linear  rank  test  statistic  based  on  (3.2.1),  refer  to  Prentice  (1978)  as 
a  basis  for  the  following  representations.  Conditions  (i)-(iii)  being  satisfied  ensures 
that  differentiation  and  integration  interchange  is  permissible.  The  rank  probability 
(3.2.1)  can  be  expressed  as 


P(r)=         J...  J        f[PidxW, 


:r0)<S(2)<—<*(k) 
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where  P{  represents  the  contribution  to  the  likelihood  of  failure  time  x^j  and  the 
censored  observations  a?(,-)i, . . .  ,i(,-)mj  contained  in  the  interval  [x(,),X(,+1)): 

P{  =  /(x^-^fix^  :  A)  +  (1  -  w)f{x(i)  :  0)]™ 

x  [F(x(t)  :  0)]m'-A/'[7rF(x(t)  :  A)  +  (1  -  K)F(x{t)  :  0)]M'  dx{i). 


Then 


dlog  P{r)  dP(r) 


On 


1 
P(r) 

1 
P(r) 


P(r) 


0 

d 


-J--jY[p,dX{i) 


:=1 

d   ' 


P(r)  I 


k    3 

,=1  dT 


l[Ptdx(i 
]JPidx{i 


•) 


(3.2.6) 


Evaluation  of  the  right-hand  side  of  (3.2.6)  at  the  null  value  ir  =  0  provides  the 
form  of  the  linear  rank  statistic  given  by  (3.2.3).  To  illustrate  this,  first  note  that 


P(r) 


=  /•••/  n/(*(o)^(*(.))m,^(.)< 

n.^*: ...£t,.  >  <  =  1 


which  can  be  directly  integrated  (Prentice,  1978)  to  arrive  at  the  result 


P(r) 


k    1 

r=0  «=1  **• 


(3.2.7) 


Next,  observe  that 
h    d 


£id* 


K=0 


d 


£  oT  {(1  "  *(.))  lQg /(*(0)  +  K  "  M')  loS  JX*©) 

1=1  a7r 
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-EN 


+2(1)  log[7r/(x(l)  :  A)  +  (1  -  Jr)/(x(0)] 

+  M,-log[7rF(x(0  :  A)  +  (1  -  tt)F(x(0)]} 
/(x(l)  :  A)  -  f(x{i)) 


7T=0 


1  =  1 


)[7r/(x(i):A)  +  (l-7r)/(x(,))] 

,[xF(x(,):A)  +  (l-7r)F(x(t))] 


ir=0 


*    f      fa,  :  A)  -  /(xH)  F(X(,,  :  A)  -  F(x(l)) 

£  rw — 7m — + M{ 

7(*(.)  :  A)        ' 


-EN 


i=l 


/(*«) 


+  M, 


F(x(t)  :  A) 


L  ^(*co) 


-i 


(3.2.8) 


Putting  (3.2.6)-(3.2.8)  together  and  then  using  the  substitution  Uj  =  1  —  F(x^) 
yields  the  linear  rank  test  statistic  given  by  (3.2.3)  with  scores  (3.2.4)  and  (3.2.5). 
D 

The  score  functions  (3.2.3)  were  reported  in  Johnson  et  al.  (1987)  for  the  situa- 
tion of  uncensored  data  and  location-shift  form  for  .F(x  :  A).  Conover  and  Salsburg 
(1988)  derived  specific  scores  corresponding  to  a  Lehmann-alternative  choice  of 
F(x  :  A).  The  next  corollary  summarizes  the  notion  that  the  resulting  linear  rank 
statistics  provide  locally  most  powerful  rank  tests  in  the  uncensored  data  case. 

COROLLARY  3.2.1  Under  the  conditions  of  Theorem  3.2.1,  suppose  that  there  are 
no  censored  observations.  The  locally  most  powerful  rank  test  (LMPRT)  is  given  by 
the  form  (3.2.3)  with  the  scores  (3.2.4). 

Proof.  Set  Mi  =  0  in  (3.2.3).  This  gives  (2.3.3),  the  LMPRT  as  discussed  in  Section 
2.3.  □ 

The  following  lemma  contains  a  sufficient  condition  for  (3.2.2)  in  Theorem  3.2.1 
when  F(x  :  A)  =  F(x  -  A). 
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LEMMA  3.2.1   Let  B  be  a  positive  constant  such  that  B  <  oo,  and  let  <j>(-)  be  defined 
as  in  (2.3.2)  or  (3.2.4).  If  F(x  :  A)  =  F(x  -  A),  then 


1  1 

f<t>2(u)du=   I 


/(F-i(l_u)_A) 
/(f-i(l-ti)) 


du  <  5/2 


implies  that 


oo 

/ 


F(x-A) 
F(x) 


f(x)dx  <  B. 


Proof.     Make  the  substitution  u  =  1  —  F(x)  and  integrate-by-parts  to  show 


/  ^*>*  ■  / 


JF(F-1(l-u)-A) 


Ju 


1-u 

0 

=    [-\og(l-u)F(F-\l-u)-A)]l 

f(F-*(l  -  «)  -  A) 


+ 


/log(l-u)' 


cfo.  (3.2.9) 


/(.F-Hl-ti)) 

Now  the  first  term  on  the  right-hand  side  of  (3.2.9)  contains  the  form  oo  •  0  when 
evaluated  at  u  =  1  and  thus  requires  some  extra  handling  to  show  that  its  value  is 
zero.  Let  X^j  denote  the  indicator  function  which  takes  on  the  value  1  when  the 
event  A  occurs  and  0  otherwise.  Let  X  be  the  random  variable  with  the  survival 
function  F.  The  substitution  x  =  F-1(l  —  u)  leads  to 


urn 

u—1 


log(l  -  u)F(F_1(l  -  u)  -  A)]     =    Jim  -  \og(F(x))F(x  -  A) 

=    Jim  -  log(F(x))^J[Ar>r_A] 
<     l\m£[-log(F(X)))l[x>x_A] 

(for  —  log(F)  is  monotone  increasing) 


=    }™S[-\og(F(X))} 


42 


-  \im£[-\og(F(X))}l[x<x.A] 
=    1-1=0 

by  the  Monotone  Convergence  Theorem  and  the  fact  that 

1 
£[-log(F(X))]  =  -J\og(l-u)du=l. 
o 

The  second  term  on  the  right-hand  side  of  (3.2.9)  is  also  finite  since 

■  i 

J  log(l  -  u)- 


%u 


/(F-i(l-u)) 

f(F-\l-u)-A) 


<  (/[log(l-u)]2^)  (I 
<2-{B/2)  <  oo 


f(F-^l-u)) 


du 


by  the  Cauchy-Schwarz  inequality  and  by  the  sufficient  condition  of  the  lemma.  □ 
The  condition  in  Lemma  3.2.1  is  closely  related  to  large-sample  properties  for 
the  linear  rank  statistic  v.  As  can  be  seen  from  Theorem  2.3.1,  it  is  the  primary 
component  of  an  expression  for  the  asymptotic  variance  of  v  with  uncensored  data, 
which  is  required  to  be  finite. 

Alternate  Summation  Form  for  v 

It  has  been  common  and  convenient  in  the  literature  to  display  linear  rank  statis- 
tics for  testing  equality  of  survival  distributions  in  the  form  of  a  sum  of  weighted 
differences  between  the  number  of  observed  deaths  and  the  number  of  conditionally 
"expected"  deaths.  Conditioning  is  done  on  the  prior  failure  and  censoring  history 
observed.  Tarone  and  Ware  (1977)  and  Prentice  and  Marek  (1979)  formally  discuss 
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this  form  for  some  of  the  more  well-known  test  statistics.  Authors  such  as  Harring- 
ton and  Fleming  (1982)  consider  whole  classes  of  statistics  that  are  distinguished 
by  the  choice  of  the  weights  on  the  differences. 

Recall  the  definition  Ri  of  the  size  of  the  risk  set  in  Section  2.2.  Let  Ru  be 
the  number  of  treated  subjects  in  the  risk  set  at  failure  time  x/;j  and  i?o,-  be  the 
corresponding  number  in  the  control  group.  Note  that  Roi  +  Ru  =  Ri-  The  present 
goal  is  to  represent  u  (defined  in  (3.2.3)-(3.2.5))  in  the  following  form: 


w 


here 


g^fco-ijf). 


tu,  =  ct  —  C, 


(3.2.10) 


(3.2.11) 


is  the  weight  function  associated  with  the  failure  time  xu\. 

A  sufficient  condition  for  v  to  have  the  form  at  (3.2.10)  and  (3.2.11)  is  that  the 
scores  C{  and  C,  satisfy 


Rid-1  =  a ;  +  (Rt  -  l)d\    i  =  i,...,k, 


(3.2.12) 


where  Co  —  0.  The  condition  at  (3.2.12)  was  first  derived  by  Prentice  and  Marek 
(1979),  who  used  it  to  represent  the  log-rank  (Mantel  1966;  Cox  1972),  Gehan 
(1965),  and  Peto  &,  Peto  (1972)  statistics  in  the  form  (3.2.10).  Since  v  can  be 
rewritten  (Prentice  and  Marek,  1979)  as  the  sum  of  the  form  (3.2.10)  and  a  quantity: 


v  =  £ m  (*<o  -  tt)  +  E *(0  (a  -  Ci  -  *((&.,  -  Ct)) , 


(3.2.13) 


where  the  second  term  on  the  right-hand  side  of  (3.2.13)  equals  zero  when  (3.2.12) 
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holds,  we  concentrate  on  showing  that  the  relation  (3.2.12)  indeed  holds  for  the 
mixed-model  scores  c,  and  C,. 

In  full  notation,  (3.2.12)  is  expressed  as 


k 


x  H  J2,-(l  -  uj)mi  duj. 


Expanding  the  above  full  version  of  (3.2.12)  yields 

'/../.>  /iA.-.,» '-')n «."-.!-■*., 

x  n^(!  -Uj)m'duj, 

j=i 

Cancelling  common  terms  on  both  sides  and  simplifying  gives  for  (3.2.12): 

*  II  3*(1  -  «i)m' Ay.  (3.2.14) 
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Assume  that  Ri  >  1.  We  adapt  a  technique  employed  by  Mehrotra,  Michalek, 
and  Mihalko  (1982).  Starting  with  the  second  term  on  the  right-hand  side  of 
(3.2.14),  first  integrate  out  ut+1, . . .  ,Uk  to  get 

r\  F{F-\\-Ui):  A) 


(tf.-l)/1     ■••/1 

Vu,_,  Jo 


F(F~i(l-Ui)) 


■Riil-ui)*-1 


i-l 


x  l[{Rj{l  -  uj)m*  duAdw 

3-1 

(Rt-i)f  ■  ■  ■  f1  niRAi  -  Ujr>  dUj] 

Jui-2  J0     -.J 

-1     F(F~\1  -  u{)  :  A) 


Jui-i 


(1  -  Ui) 


/2,-(l-u,-)       d"i-        (3.2.15) 


Next,  apply  the  integration-by-parts  technique  to  the  integral  over  u,-.  Let 


U    =     F(F-\1  -«,).;  A),  and 


dV    =    Riil-Uif'^dm. 


By  the  sequence  of  relations 


it  follows  that 


V    = 
F(y)    = 

f{y)dy   = 
dy 

dui 


F-l(l-Ui), 

1  -  Ui, 

diii, 

1 


/(</)' 


dU_ 

dux 


—    =    -/(*-*(! -«,):  A) 


dF-\l  -  m) 


dui 


=    -/(F"l(l  -  «,) :  A) * 
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and 


V     = 


ft 
Ri-1 


(l-u,f>-\ 


Use  of  these  relations  in  the  last  integral  of  (3.2.15)  gives 


f1     F(F~l{l  -  Ui)  :  A)ft(l  -  Ui)R'-2dui 

Ju,-i 

=  [^  F(F-\l  -  «,)  :  A)d  [-^y(1  -  ^ 

■FiF-'il-u^-.AXl-uif'-1 


\R,-i 


Ri 


+ 


Ri-1 

i     f(F-l(l-u,):A)     Ri 


«i-i 


/' 


f(F-*(l-Ui))     Ri-1 


(1  -  u,)"*-1^ 


Ri 


=  -^"ZriO  -  F{F~l(l  -  «,-i)  :  A)(l  -  U,_1)H'-1} 
+  ( ft_A  r       /(f'ft  -  uj)  :  A) 

ft 


(1  -  u,)*'1^ 


ft     /i      /(F-'(i  -  «,) ;  A) 

-RT-lLt  -     /(F->(1 -«•■))     (    ~~     }  ''"  (  } 

Then  with  the  additional  relation  implied  by  the  definition  of  i?j  in  Section  2.2: 

m;  =  Rj  -  i?_,_!  -  1,         ;  =  1, . . . ,  k, 

substituted  into  (3.2.16),  we  get 


(ft-l)(C<  +  l) 

=(ft-i)/1  ■••fnw-i,^} 

X  R~~[F{F~1{1  ~  U*'-l)  :  A)(1  "  U,'-l)fl'_1 

-(ft-i)/1  -fi¥-«ir*K) 
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=  R,[l  •••  fn{^(i-«,rA} 

X i£,-_i(l-u,-_i)  dU,-! 

(1  -  W,_iJ 

-71  ■•■rnWi-t./^;} 

X     /(F-(l -«.-))    /?,(1-U,)       ^ 

=  &(£_!  +  1)  -  (C,  +  1), 

which  is  just  a  re-expression  of  (3.2.12)  or  (3.2.14)  since  c,  and  C,  are  of  the  same 
form  after  integration  over  u,+1, . . . ,  uk. 

Now  if  R{  =  1,  we  have  A;  =  i,  and  mjt  =  0.  Here,  the  condition  (3.2.12)  reduces 


to 


ck  =  Cjt_] 


The  relation  (3.2.17)  can  be  verified  as  follows: 


ri      [/(F-'(l-tifc):A) 
"4-  I    /(F-Ul-^)) 


(3.2.17) 


x  /  ...  /  n{^i(i-«i)m>^}^* 

=  /'     ...  1°  [^"'(l-ttfc-i):^)  _  x" 

iu*_2  7l      [  (1  -  Ufc_l) 

Jfc-2 

x%_1)(i  -  u(fc_1))i+A'*-'  n^ii-tyr^jdui 

=  /'  /•°rF(F-1(l-u,_1):A)      ^ 

A*_a        A     [  (1  -  Ujt-i) 

fc-2 
Xi2fc_!(l  -  Uk.x)R^-X  XliRjil  ~  Uj)midUj}duk 

=  Ck-i- 


j=i 
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Stochastic  Integral  Form  for  u 


The  statistic  at  (3.2.10)  can  be  readily  translated  into  the  form  of  a  stochastic 
integral.  Recall  that  the  control  group  sample  is  labelled  sample  0;  the  treatment 
group  sample  is  labelled  sample  1. 

Recall  the  notation  defined  in  Section  2.2,  and  recall  the  definition  of  indicator 
function  1(A)  =  1  or  0,  according  to  whether  the  event  A  occurs  or  does  not  occur. 
For  the  groups  r  =  0, 1,  with  survival  time  random  variables 

Xrj,       j  =  l,...,Nr, 

and  censoring  indicators  (as  defined  in  Section  2.2) 

8rj,     j  =  l,...,Nr, 
let 

Mr{x)  =  Y.AXr3<X,8r3  =  l). 
.7  =  1 

Note  MT(x)  is  the  number  of  failures  for  group  r  that  occur  no  later  than  time  x. 
Also,  let 

Yr{x)  =  YJl{Xr,>x), 
j=\ 

the  size  of  the  risk  set  for  group  r  at  time  x.  Note  that  Yr(x(^)  =  RT{  in  our  earlier 
notation  for  risk  set  sizes.  Let 


Y1(x)  +  Y0(x)  =  Y(x), 


Mi(x)  +  tf0(x)  =  Af{x), 
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and 


dK(x)=K(x)-K(x-). 


The  assumptions  of  continuous  survival  distributions  implies  that  the  probability 
of  ties  among  the  X(j)  is  zero.  Thus  with  probability  1, 

( Z(i),    if  *  =  «(<); 


dW,(x)  = 


and 


0,        otherwise, 


<M/"0(x) 

0,  otherwise. 

So  a  stochastic  integral  expression  for  v  can  be  written  as 

E(      Ru  +  Roi      Ru\ 
^VU—Ri -R-) 


s  !>»•  ( -5-*(o  -  ir^-Hid) 


1=1 


^0 


Ri 

Ru 
Ri 


=Su;,^)Z(,)"Su;,(ii)(1"2(,)) 


f°° 

=   /      u;(x) 

./— oo 

/oo 
u;(x) 
-oo 


Ko(ar) 


y(x) 

Y0(x)Y1(x) 


(iA/i(x)  -  /     w{x) 
Jo 


Yi(x) 


Y0(x)  +  Y1(x) 


IY(*)\ 

dM^x)       <UV0(x) 
Yi(x)         YQ(x) 


djV0(x) 


(3.2.18) 


where 


w(x)  =  C,  -  C,,       X(.)  <  x  <  x,+1. 

3.3     Conditions  for  Asymptotic  Properties  Under  Hp 

The  formulation  of  v  as  a  stochastic  integral  allows  us  to  apply  the  martingale 
theory  of  Gill  (1980)  with  suitable  adaptations.    His  work  includes  the  statement 


of  certain  conditions  that  are  sufficient  to  derive  an  asymptotic  distribution  for  a 
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properly  standardized  version  of  v.  Recall  the  definitions  of  <f>(u)  and  <I>(u)  at  (3.2.4) 
and  (3.2.5),  respectively.  The  notation  of  <f>(u)  and  $(u)  as  score  functions  that  cor- 
respond to  uncensored  and  censored  observations,  respectively,  was  introduced  by 
Prentice  (1978)  in  the  framework  of  regression  models.  Subsequently,  the  notation 
has  been  used  in  the  area  of  censored  data  linear  rank  statistics  (Harrington  and 
Fleming,  1982;  Cuzick,  1985).  Thus  we  adopt  the  notation  for  the  remainder  of  this 
dissertation. 

In  this  section  we  lead  up  to  sufficient  conditions  that  permit  investigation  of  the 
asymptotic  properties  of  the  censored  data  test  based  on  v.  In  the  process  we  will 
show  that  for  large  samples,  the  expected  scores  c,  and  C,  in  v  can  be  respectively 
replaced  with  approximate  scores  that  are  easy  to  compute. 

Approximate  Scores 

The  computational  complexities  associated  with  linear  rank  procedures  that 
use  expected  value  scores  c,  =  £[<f>( a,-)]  has  long  been  recognized  in  the  literature. 
For  uncensored  data  linear  rank  procedures  designated  to  detect  a  shift  in  a  dis- 
tribution, for  example,  the  normal  (van  der  Waerden,  1953)  approximate  scores 
c,  =  F-1(l  —  77V7),  where  1  —  F  is  the  cumulative  distribution  function  of  the 
standard  normal  distribution,  are  much  easier  to  evaluate  than  the  expected  scores. 
In  fact,  the  normal  expected  scores  are  the  £(Z^)),i  =  1, . . . ,  N,  where  Z^  is  the 
zth  order  statistic  in  a  random  sample  of  size  N  from  the  standard  normal  distri- 
bution. These  scores  involve  integrals  that  do  not  have  closed-form  expressions. 
More  so  in  the  censored  data  case,  the  expected  scores  involve  integrals  that  do 
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not  have  streamlined  closed  forms  and  require  numerical  integration  methods  to  be 
evaluated. 

Prentice  (1978)  conjectured  that  the  expected  scores  c,  and  Ct-  from  the  rank 
likelihood  he  derived  could  be  approximated  by  the  respective  scores  ^(1  —  F(xii\)) 
and  $(1  —  F(x(,))).  A  subsequent  article  by  Cuzick  (1985)  confirms  the  truth  of  the 
conjecture  with  the  requirement  that  some  fairly  mild  conditions  on  the  uncensored 
score  function  <f>(u)  be  satisfied.  The  resulting  asymptotic  equivalence  of  the  linear 
rank  tests  based  on  the  two  types  of  scores  is  one  in  the  sense  that  the  distance 
between  the  two  test  statistics  converges  to  zero  in  absolute  mean  as  N  — ►  oo. 
Before  stating  Cuzick's  (1985)  main  result,  we  state  two  lemmas. 

LEMMA  3.3.1    The  following  relationship  holds  for  i  =  1, . . . ,  k: 

1    /i 

<b(u)  =  -  I    <j>{v)dv. 

u  Ju 

Proof.     Utilize  the  transformation  v  =  I  —  F(u)  to  get 
/    <f)(y)dv 

Ju 

=  -\F{F-\\-v):  A)]*  -(1-u) 

=  -F(F-\0)  :  A)  +  F(F-\1  -  u)  :  A)  -  (1  -  u) 

=  F{F-\l-u):  A)-(l-u), 

so  that  dividing  both  sides  by  1  —  u  and  substituting  with  the  relation 
F(F~l(l  —  u))  =  1  —  u  completes  the  proof.  □ 
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Lemma  3.3.1  provides  a  well-defined  functional  relationship  between  uncensored 
and  censored  score  functions.  Mathematical  conditions  on  <j>(u)  such  as  continuity 
will  imply  similar  ones  for  $(u). 

Consider  the  following  three  sample  quantities  to  estimate  the  survival  function 
F(x): 


pKM{x)  =     n  (fci) 


R  ' 

xu)<x  "■! 


(3.3.1) 


FP(x)    =      II    TJTTTy  (3-3-2) 

xU)<x  \Hi  +  x) 

FA(x)    =    exp(-  £    1),  (3.3.3) 

xu))<x       3 

where  R3  is  the  size  of  the  risk  set  at  time  X(j). 

The  estimator  FKM  is  the  so-called  product-limit  estimator  (Kaplan  and  Meier, 
1958).  The  estimator  Fp  is  discussed  in  Prentice  (1978),  and  FA  was  proposed  by 
Altshuler  (1970).  For  generating  censored  data  linear  rank  test  statistics  in  the  case 
of  location  alternatives,  a  judicious  choice  from  these  three  estimators  has  led  to 
nice  closed-form  expressions  for  expected  and  approximate  scores  (Prentice,  1978; 
Harrington  and  Fleming,  1982).  In  numerical  computations  for  large-samples,  any 
one  of  the  three  estimators  is  reasonable.  The  actual  relationship  (Cuzick,  1985) 
among  the  three  is 

FKM{x)  <  FA(x)  <  Fp{x)   for  all   x. 

Unless  specified  otherwise,  F  will  be  the  general  representation  for  the  three 
estimators.  One  danger  in  the  use  of  (3.3.1)  is  when  it  takes  on  the  value  0  and  causes 
an  infinite-value  approximate  score.    This  situation  may  arise  for  the  normal  case 
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for  example,  since  both  the  uncensored  and  censored  score  functions  are  unbounded 
as  u  — ►  1.  One  remedy  in  practice  is  to  replace  an  occurrence  of  0  with  a  value  of 
0.001  for  instance,  but  this  is  a  very  subjective  device  that  can  cause  considerable 
difference  in  values  of  the  approximate  scores.  Harrington  and  Fleming  (1982) 
employ 

p™{x-)=  n  i-^1, 

xu)<x  nj 

which  takes  on  values  in  the  interval  (0, 1].  Both  (3.3.2)  and  (3.3.3)  strictly  take 
on  values  in  the  interval  (0, 1).  Also,  (3.3.2)  can  be  regarded  as  a  generalization  for 
the  value 

1-       *' 


N  +  l 

that  is  always  used  in  the  literature  for  approximate  scores  on  uncensored  data.  So 

Fp  is  the  estimator  of  choice  in  practice  for  this  manuscript. 

Theorem  3.3.1  below  is  a  modified  statement  of  Theorems  1  and  2  of  Cuzick 
(1985).  The  theorem  provides  us  with  a  set  of  sufficient  conditions  for  the  difference 
between  v  based  on  the  approximate  function  scores  and  v  based  on  the  expected 
scores  to  be  negligible  in  the  limit  as  N  —»  oo.  The  sufficient  conditions  are  fairly 
mild;  the  scores  that  make  up  some  of  the  well  known  linear  rank  statistics  such  as 
the  log-rank,  Peto  &  Peto,  and  Harrington  k  Fleming's  (1982)  G",  0  <  p  <  1  class 
satisfy  them  (Cuzick,  1985). 

THEOREM  3.3.1   Assume  the  following  conditions  on  the  score  function  <j){u). 

C-l   The  uncensored  score  function  <f>{u)  is  twice  continuously  differen- 
tiate on  (0, 1)  with  first  and  second  derivatives  <j>'(u)  and  4>"(u). 
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C-2 


n 

\u<f>'(u)\  +  \u24>"{u)\  <  —  for  some  a  <  1/2  and  B  <  oo. 


C-3 


lim  iVVarf— v)  >  0. 


Then  the  scores  c<  and  C{  in  (3.2.8)  &  (3.2.9)  converge  in  probability  to  <f)(l  —  F(x}) 
and  $(1  —  F(x))  as  N  — >  oo,  respectively.  F(x)  can  be  either  (3.3.1),  (3.3.2),  or 
(3.3.3). 

Proof.  Refer  to  Theorems  1  and  2  of  Cuzick  (1985).  The  fact  that  d>(u)  can  be 
differentiated  twice  with  continuity  maintained  will  imply  the  same  properties  for 
$(u)  from  Lemma  3.3.1.  □ 

The  general  forms  of  the  derivatives  of  (f>(u)  are  messy,  so  that  the  score  functions 
for  specific  distributional  choices  must  be  checked  to  see  if  C-l  and  C-2  are  satisfied. 
The  non-vanishing  Condition  C-3  on  the  variance  will  follow  from  the  structure 
obtained  when  asymptotic  properties  of  v  are  established  in  Section  3.4.  Cuzick 
(1985,  Theorem  3)  indicates  that  C-3  is  satisfied  for  such  a  structure  of  the  variance. 

Gill's  Conditions 

Theorem  3.3.1  brings  us  to  the  point  where  the  martingale-based  results  of  Gill 
(1980)  can  be  used  to  establish  an  asymptotic  variance  for  v  as  well  as  normality 
(under  the  null  hypothesis  of  equal  survival  functions)  for  its  asymptotic  distribu- 
tion. The  usual  random  censorship  model  (cf.  Section  2.2)  is  assumed. 
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The  stochastic  integral  form  for  v  in  (3.2.18)  can  also  be  considered  in  the  form 


where 

Y0(x)Y1(x) 


K{x)  =  w(x) 


(3.3.5) 


[Yo(x)  +  Yi(x)\ 

is  a  weight  function  in  the  class  K,  of  Gill  (1980).  Gill  (1980)  defines  members  A"  €  K. 
as  a  function  of  the  observations  that  equals  zero  whenever  min(Vo(x),yi(x))  =  0. 
The  paper  of  Andersen,  Borgan,  Gill,  and  Keiding  (1982)  shows  that  the  class 
of  statistics  from  Prentice  (1978)  that  uses  the  Fp  estimator  can  be  written  as 
a  stochastic  integral  of  the  form  (3.3.4)  with  K(x)  as  in  (3.3.5).  Moreover,  the 
Prentice  (1978)  class  is  a  subset  of  the  Prentice  and  Marek  (1979)  class  based  on 
the  "preservation  of  scores"  condition  at  our  (3.2.12).  The  Prentice  and  Marek 
(1979)  class  can  likewise  be  written  in  the  framework  of  (3.3.4)  and  (3.3.5). 

Gill  (1980,  p.  72)  provides  three  conditions  that  are  sufficient  for  the  asymptotic 
normality  of  test  statistics  of  the  stochastic  integral  form  (3.3.4).  These  sufficient 
conditions  can  be  applied  with  many  distributional  forms  for  the  survival  and  censor- 
ing variables,  and  any  test  statistics  with  weight  functions  K(x)  that  are  members 
of  the  general  class  JC.  We  state  Gill's  (1980)  conditions  as  they  are  relevant  to  our 
case  under  the  null  hypothesis. 

Let  X*  be  the  set  with  membership 

{  x  |  mm{F0(x)  >  0,  Fx{z)  >  0)  },  (3.3.6) 


56 

where 

F0(x)    =    F(x)-Lo(x), 
A(s)    =    G{x)-Lx{x). 

Let  Ar(x),r  =  0, 1  be  the  cumulative  hazard  functions  associated  with  the  control 
and  treatment  groups,  respectively. 

1.  Forr  =  0,1: 

a.  AT(x)  is  finite  on  J*. 

b.  K2(x)/YT(x)  converges  uniformly  in  probability  to  hr(x)  as  N  — >  oo  on  each 
closed  subinterval  of  J*.  The  function  hT(x)  is  left-continuous,  bounded,  and  has 
right-hand  limits  on  each  closed  subinterval  of  X* . 

c.  YT{x)  — >  oo  in  probability  as  N  — >  oo  for  each  x  £  J*. 

II.  Let  S  =  sup  J*.  If  S  $  I*,  then  for  r  =  0, 1: 


/    hr(x)dAT(x)  <  oo. 
Ji- 

b. 

hmJimoSupPr( J      K2 (t) / Yr(t)dAr(t)  >  e)  =  0 

for  every  e  >  0. 
III.  IfS<  oo,  then  for  r  =  0,1: 

/        K2(x)/Yr(x)dAr(x) 

J(S,oo) 

converges  in  probability  to  0  as  N  — ►  oo. 
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When  conditions  I,  II,  and  III  hold,  two  consistent  estimators  for  the  variance  of  v 


are 


1     r 

r=0J~ 


K\x) 


Yr(x) 


dMjx) 

Yr(x) 


(3.3.7) 


and 


V* 


-I 
=  /: 


-°°r=0 


K\x) 


Yr(x) 
K\x) 


[Yi(x)Y0(x)\ 


d(Ar1(x)+AfQ(x)) 
Yl(x)  +  Y0{x) 

d(M(x)+Af0{x)). 


(3.3.8) 


The  variance  estimators  at  (3.3.7)  and  (3.3.8)  are  suggested  by  Gill  (1980,  p.  47). 
Lemma  4.3.1  of  Gill  (1980)  establishes  that  both  (3.3.7)  and  (3.3.8)  are  consistent 
estimators  under  Ho  for  the  true  limiting  variance  of  v  based  on  the  form  (3.3.4). 
Gill  (1980)  notes  that  there  is  no  true  ordinal  relationship  between  V\  and  V?,  and 
we  would  expect  them  to  be  fairly  equal  in  practice  for  large  samples.  For  the  null 
hypothesis  case,  there  is  some  creedence  to  choosing  V-i  on  the  basis  that  it  utilizes 
both  samples  to  get  a  pooled-type  of  estimator  of  the  common  cumulative  hazard 
function  A(x). 

In  order  to  verify  the  three  conditions  of  Gill  (1980)  and  thus  obtain  variance 
expressions  and  asymptotic  normality  of  i/,  we  consider  some  mathematical  results. 

LEMMA  3.3.2   (A  continuous  function  preserves  uniform  convergence.) 
Let  {/yv(x)}  be  a  sequence  of  random  functions  that  converges  uniformly  in  prob- 
ability to  f(x)  as  N  — *  oo  for  x  6    some  set  E.    If  g(-)  is  a  continuous  function 
on  the  range  of  {/jv(i)},  then  5f(//v(2r))  is  uniformly  convergent  in  probability  to 
g{f(x))  as  N  — ►  oo  for  x  £  E. 
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Proof.     We  know  that  for  arbitrary  8  >  0, 

Pr(sup  \fN(x)  -  f(x)\  >  8)  -»  0  as  N  -*  oo. 

Let  e  >  0  be  given.  Since  g(-)  is  continuous,  we  can  find  8  >  0  such  that  |<7(//v(z))  — 
<7(/(x))|  <  e  if  \/n(x)  —  f(x)\  <  8.  Thus  for  arbitrary  e, 


Pr(sup\g(fN(x))-g(f(x))\>e)    =    1  -  Pr(sup  \g(fN(x))  -  g(f(x))\  <  e) 

xeE  xeE 

<     l-Pr(sup|/jv(x)-/(x)|<6) 

=    Pv(sup\fN(x)-f(x)\>8) 
xeE 

^0 


as  N  — ►  oo.  □ 

LEMMA  3.3.3  Le£  {/1at(x)}  6e  a  sequence  of  random  functions  that  is  uniformly 
convergent  to  A(x)  in  probability  as  N  — >  oo.  Let  {Bn(x)}  also  be  a  sequence 
of  random  functions  that  uniformly  converges  to  B(x)  in  probability  as  N  — ♦  oo. 
Assume  that  A(x)  and  B(x)  are  bounded  for  all  x  £  E.  Then  A^(x)B^(x)  converges 
uniformly  in  probability  to  A(x)B(x)  as  N  — »  oo. 

Proof.     Let  e,  c  >  0.  Then 


Pr(sup  \{AN(x)  -  A(x))(BN(x)  -  B(x))\  >  e) 

xeE 

=  Pr(sup  \(AN(x)  -  A(x))(BN(x)  -  B(x))\  >  e,  sup  \BN(x)  -  B(x)\  <  e/c) 
xeE  xeE 

+  Pr(sup  \{AN(x)  -  A(x))(BN(x)  -  B(x))\  >  e,  sup  \BN{x)  -  B(x)\  >  e/c) 
xeE  xeE 

<  Pr(sup\(AN{x)-A(x))\  >  c) 
xeE 
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+  Pr(sup  \BN{x)  -  B(x)\  >  e/c) 
xeE 

— ►  0  +  0  =  0.  (3.3.9) 


The  boundedness  condition  implies  that  |^4(a;)|   <   M  for  some  positive  number 
M  <  oo.  It  is  also  assumed  that 


Pv(snp\BN(x)-B(x)\>^-)^0. 
xeE  M 


Combining  these  last  two  facts  with  the  relation 


Pr(sup  \A(x)[BN(x)  -  B(x)}\  >  e)  <  Pr(sup  \BN(x)  -  B{x)\  >  -^) 
xeE  xeE  M 


implies  the  result 

Pr(sup  \A(x)[BN(x)  -  B(x)}\  >  e)  — >  0  (3.3.10) 

xeE 

as  N  — >  oo.    Commuting  the  roles  of  the  two  sequences  {A^(x)}  and  {Bn(x)} 

similarly  gives 

Pr(sup  \B{x)[AN(x)  -  A(x)}\  >  e)  — >  0.  (3.3.11) 

xeE 

Next  employ  the  identity 

AN(x)BN{x)  -  A(x)B(x) 

=  (AN(x)  -  A(x))(BN(x)  -  B(x))  +  A(x)(BN(x)  -  B(x)) 
+B(x)(AN(x)-A(x)) 

along  with  (3.3.9)— (3.3.11)  to  give 


Pr(sup  \AN(x)BN(x)  -  A(x)B(x)\  >  c) 
xeE 
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<  Pr(sup|[^(x)  -  A(x)][BN(x)  -  B(x)}\  >  f) 
xeE  o 

+  Pr(sup|[AJV(x)  -  A(x)]B(x)\  >  |) 

x£E  O 

+  Pr(sup  \A(x)[BN{x)  -  B{x)}\  >  £) 

z€E  o 

— >0+0+0=0 
as  tV  — >  oo,  which  is  the  desired  result.    □ 

3.4     Asymptotic  Normality  of  v 

Recall  that  Theorem  3.3.1  provides  a  set  of  sufficient  conditions  whereby  the 
expected  scores  in  the  censored  data  rank  statistic  v  can  be  replaced  by  approx- 
imate scores  for  large  samples.  The  test  statistics  considered  by  Gill  (1980)  in 
establishing  asymptotic  properties  can  all  be  regarded  as  censored  data  linear  rank 
statistics  based  on  approximate  scores.  Therefore,  in  the  development  of  asymptotic 
properties  for  u  that  follows,  we  will  consider  the  version  of  v  that  is  based  on  the 
approximate  scores,  i.e. 

v  =  £>(|-)<K1  -  F(x{t)))  +  Mi*(l  -  F(x(i))). 
i=i 

Under  the  conditions  of  Theorem  3.3.1,  the  same  properties  for  v  based  on  the 

expected  scores  (as  in  (3.2.2))  will  follow. 
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Verifying  the  Conditions 

Gill  (1980)  used  the  log-rank  statistic  as  a  prominent  member  of  the  class  K. 

Among  the  many  properties  he  explored  for  this  member,  he  showed  the  meeting  of 

the  three  conditions  listed  on  page  56  of  Section  3.3  that  are  sufficient  for  a  limiting 

normal  distribution  of  the  properly  standardized  version  of  the  statistic  under  the 

null  hypothesis  of  equal  survival  distributions.    In  the  form  (3.2.18)  for  i/,  setting 

w(x)  =  1  gives  the  log-rank  statistic.    The  corresponding  weight  function  in  the 

class  fC  is 

„,,         Y0(x)Yl(x) 

Kc{x)  =  ymTWY  (3A1) 

If  we  multiply  Kc(x)  by  the  weight  function 

^(x)  =  /(^-Wf)):A)_F(F-(F(f)):A) 

/(F-'(F(x)))  F(F-'(F(x)))  l         ' 

for  our  model  at  (2.2.4),  we  get  a  weight  function  K(x)  that  is  a  member  of  Gill's 
(1980)  class  K..  The  corresponding  function  w  is  random  since  it  is  a  function  of 
the  approximate  scores  4>{\  —  F)  and  $(1  —  F).  Note  that  w(x)  at  (3.3.5)  is  not 
random  as  it  is  a  function  of  the  c,  and  C,.  Convergence  properties  associated  with 
w(x)  at  (3.4.2)  in  combination  with  results  already  well-established  (Gill,  1980)  for 
the  log-rank  statistic  will  provide  the  course  for  verifying  the  three  conditions  that 
are  sufficient  for  asymptotic  normality  of  v. 

LEMMA  3.4.1  Suppose  that  f(t  :  A),  A  >  0  and  f(t)  =  f(t  :  0)  are  continuous 
densities  with  support  over  the  interval  (—00,00).    If  Ai  <  00  and  Ai  <  00  are 
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positive  constants  such  that 

fit  :  A)  fit  :  A) 

t-*oo       /(f)  t-*-oo       /(f) 

then  w(x)  defined  at  (3.4.2)  is  bounded  on  the  interval  (—00,00). 
Proof.     By  L'Hopital's  rule, 

t-00     F(t)  *-«)     /(f) 


Therefore, 


f/(f:A)       F(t:A)\  f{x  :  A)  /(f  :  A) 

hm  <  — — — =— - —  >     =  hm  — — — - hm  — — — — 

<-oo\      /(f)  F(t)      }  ^00       /(X)  t^co       f(t) 

=  A1-A1  =  0. 


Also, 


(3.4.3) 


|/(^A)_£^A)|    m  /(^A)_  F^A) 

<— <*>\    /(f)  F(f)    J  '—00     /(f)        ^-<*>     F(f) 

=    A2  -  1  <  00. 

Now  since 

/(<  :  A)      F(t :  A) 

is  continuous,  then  for  any  finite  values  a,  6,  the  function  (3.4.3)  is  bounded  on  every 
finite  interval  [a,b].  Since  (3.4.3)  is  finite  at  the  points  —00  and  00,  the  substitution 

t  =  F-\F(x)) 

shows  that  w(x)  at  (3.4.2)  is  bounded  on  (—00,00).  □ 

The  value  of  the  Lemma  3.4.1  will  be  seen  when  we  choose  specific  forms  for 
F(x  :  A),  but  it  should  be  noted  that  it  does  not  cover  the  case  where  both  the 
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censored  and  uncensored  score  functions  tend  to  infinity  as  x  tends  to  infinity.  For 
this  case,  the  "indeterminate"  form  oo  —  oo  for  the  "limit"  of  w(x)  requires  special 
handling.  An  example  of  this  occurs  in  Chapter  Four. 

THEOREM  3.4.1  (Uniform  convergence  in  probability  of  the  Kaplan-Meier  esti- 
mator.) Assume  the  model  outlined  in  Section  2.2,  with  the  support  of  G(x)  to  be 
(—00,00).  Let  s  €  (  —  00,00)  be  such  that  the  risk  set  size 


Y(s)  =  Y0{s)  +  Y1(s)-^oo.  (3.4.4) 


Then  under  the  null  hypothesis  of  F(x)  =  G(x)  for  all  x,  the  Kaplan-Meier  estima- 
tor FKM  converges  uniformly  on  x  G  [—00,5]  in  probability  to  F  as  N  — +  00.  That 
is, 

FKM(X)  -  F(x) 


sup 

— oo<i<s 


Ao. 


Proof.  See  Theorem  4.1.1  of  Gill  (1980),  who  proves  a  more  general  case.  His 
notation  uses  F(x)  for  the  cumulative  distribution  function.  The  relation  (3.4.4), 
which  is  given  as  Condition  Ic  of  Gill  (1980),  has  the  interpretation  that  the  size  of 
the  risk  set  at  any  point  x  grows  to  infinity  as  N  — ►  00.  □ 

If  the  support  of  the  censoring  distributions  is  also  (—00,00),  then  5  =  00. 
Sometimes,  however,  censoring  distributions  are  assumed  to  have  support  of  the 
form  (—00,5),  where  s  <  00,  and  examples  can  be  constructed  to  show  that  the 
uniform  convergence  of  FKM  fails  for  x  >  s. 

COROLLARY  3.4.1  Theorem  3.4.I  holds  for  FKM(x)  replaced  by  either  Fp{x)  or 
FA{x). 
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Proof.      The  case  for  FA  is  also  covered  in  Gill's  (1980)  Theorem  4.1.1.   For  the 
estimator  Fp ,  first  consider  the  following  relation  from  the  lone  lemma  of  Cuzick 


(1985): 


\fp(x)-Fkm(x)\<Fp(x) 


Y(x) 


(3.4.5) 


Then 


sup 

—  oo<i<s 


Fp(x)  -  F(x) 


sup 

— oo<:r<s 


<        sup 

—  00<X<3 


Fp{x)  -  FKM(x)  +  FKM(x)  -  F(x)\ 
Fp(x)  -  FKM(x)\ 


+     sup     \FKM(x)-F(x)\ 


—  oo<i<a 


0  +  0  =  0 


since 


sup 

—  00<X<3 


FhM(x)-F(x) 


by  Theorem  3.4.1  and 


sup 

-oo<x<* 


F'{x)-F*M(x) 


<        sup 

— oo<x<s 


Fp(x) 


Y(x) 


<    2/     inf     Y(x) 

—  <X><X<3 


0 


by  (3.4.4)  and  (3.4.5).    □ 

Using  these  last  two  results,  we  now  consider  a  key  property  for  the  weight 
function  w(x)  at  (3.4.2). 

THEOREM  3.4.2  Assume  the  conditions  of  Theorem  3-4-1  and  Lemma  3-4-1- 

Then  the  weight  function  w(x)  represented  by  (3.4.2)  is  bounded  and  w  converges 
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uniformly  in  probability  to  w"  on  (  —  00,00)  as  N  —*  00,  where 

f(x  :  A)       F(x  :  A) 

w  {x)  =  ~W) — m~-  (3A6) 

Proof.    Since  F(x)  is  assumed  continuous,  its  inverse  function  F~1(x)  is  also  con- 
tinuous. The  function 

/(/-(■):  A)       F(F-'(-)  :  A) 
W{)  =      /(*-<■))      "      F(f-(-»  (3'4J) 

will  also  be  continuous  since  it  is  a  composite  function  of  continuous  functions.  The 

survival  function  estimator  F  takes  on  values  in  the  interval  [0, 1],  so  Lemma  3.4.1 

insures  that  w(x)  is  bounded  when  it  is  defined  on  the  extended  line  [—00,  00]. 

Applying  Lemma  3.3.4  with  (3.4.7)  in  the  role  of  the  continuous  function  and  F(x) 

as  the  argument,  where  F  has  the  uniform  convergence  properties  in  Theorem  3.4.1 

and  Corollary  3.4.1  proves  the  uniform  convergence  of  w(-).  □ 

As  mentioned  previously,  the  three  conditions  of  Gill  (1980)  have  been  discussed 

and  verified  for  the  member  Kc{x)  which  corresponds  to  the  log-rank  statistic. 

Actually,  Gill  (1980)  attaches  a  standardization  factor  on  to  the  weight  function  at 

(3.4.2)  to  get 


The  multiplicative  factor  is  attached  to  ensure  that  the  variance  of  the  test  statistic 
corresponding  to  Kc(x)  is  bounded  away  from  0  and  00  as  N  — ►  00.  Note  that  this 
provides  a  test  statistic 


No  +  AT, 
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where  v  remains  defined  as  in  (3.2.10)  and  (3.2.18).  The  variance  estimators  V%  and 
K2  respectively  given  by  (3.3.7)  and  (3.3.8)  can  be  adjusted  accordingly,  and  their 
consistency  for  the  variance  of  v*  is  established  in  Lemma  4.3.1  of  Gill  (1980).  That 
lemma  is  a  precursor  to  Gill's  (1980)  Corollary  4.3.1  and  Proposition  4.3.3,  which 
specify  the  asymptotic  distribution  for  the  test  statistic  based  on  Kq{x), 
The  standardization  factor  of  Gill  (1980)  is  specifically 


I N0  +  AT, 


V    NoN-i    ' 
The  previous  paragraph  defined  the  relationship  between  v  and  v* .  Suppose  there 
exists  a  variance  of  */*,  Var(i/*),  such  that 


>/Var(i/-) 
is  a  random  variable  with  some  asymptotic  distribution.    Also  suppose  that  the 

variance  estimators  V\  and  Vj  are  such  that 

1  N0NX      J 
m    No  +  th 

2  N0NX      2 

are  consistent  estimators  of  Var(i/*).  By  Slutsky-type  arguments, 

v"  v*  v* 

and       g=    and 


will  have  the  same  asymptotic  distribution.  But  note  that 


>/Var(*-)  y^v/Var^)' 

v"  v 

7W  =  7W' 
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so  there  are  numerous  expressions  of  random  variables  that  are  asymptotically  equal 
in  distribution. 

The  proofs  and  surrounding  discussions  of  Gill's  (1980)  Corollary  4.3.1  and 
Proposition  4.3.3  note  that 

Yr(x) 


sup 

— oo<x<oo 


-  Fr(x) 


^0;     r  =  0,l,  (3.4.9) 


where  Fr(x)  =  Pr(Xri  >  x)  as  introduced  in  Section  2.2,  and 

min(W0,JV1)-»oo;  J^Jf  -*  Pr  e  (0,1)  (3.4.10) 

are  sufficient  to  establish  conditions  I— III  for  various  censorship  models  and  thus 
the  asymptotic  distribution  of  the  log-rank  statistic.  The  Glivenko-Cantelli  theorem 
can  be  used  to  show  that  (3.4.9)  is  satisfied  for  the  mixture  model  at  (2.2.4)  under 
the  random  censorship  model.  (Gill,  1980;  Harrington  and  Fleming,  1982). 

THEOREM  3.4.3  Assume  the  random  censorship  model  described  in  Section  2.2, 
and  let  (3.4.10)  obtain.  Suppose  w(x)  is  defined  by  (3.4.2),  Kc[x)  is  defined  by 
(3.4.1),  and  Kq(x)  is  defined  by  (3.4.8).  Let  I*  be  defined  as  in  (3.3.6),  and  suppose 
w(x)  is  bounded  on  J*.   Then  the  test  statistic 


will  satisfy  Conditions  I-IH  on  page  56  as  they  are  required.  Define 


Var(i/")  =  f 

J — i 


f{x:  A)      F{x  :  A) 
f(x)  F(x) 


2  r 


F0(x)F1(x) 


poFo(x)  +  p%Fi(x) . 


dA(x).       (3.4.11) 


Recall  the  mixture  model  at  (2.2.4).    Under  the  null  hypothesis  tt  =  0, 


>/Var(^)        7Sr\/Var(^) 
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converges  in  distribution  to  a  standard  normal  random  variable  as  N  — ►  oo.   Fur- 
thermore, both  estimators 

Yl-r(x) 


/oo 
w\x) 


Y0(x)  +  Y1(x) 


<Wr(x) 


and 


V, 


/oo 
w\x) 
-oo 


Y0(x)Y1(x) 


[(Y0(x)  +  Yi(x))* 


d(Af0(x)+Ar1(x)) 


will  be  such  that 


V\    and    — — — —  V<i 


N0N,    "    """      N0Ni 
converge  in  probability  to  Var(z/*)  as  N  — ►  cxd. 

Proof.  Under  the  null  hypothesis  w  =  0  and  the  random  censorship  model  outlined 
in  Section  2.2,  Conditions  (3.4.9)  and  (3.4.10)  are  sufficient  for  conditions  I— III  to 
hold  for  Kc(x).  We  start  with  verification  of  Condition  la.  For  those  values  of  x 
in  T  such  that  mm(F0(x) ,  Fi(x))  >  0,  both  F(x)  >  0  and  G(x)  >  0.  Thus  the 
corresponding  cumulative  hazard  functions  will  be  less  than  infinity,  i.e.  finite  on  I. 
In  Condition  lb,  let 


K\x)jYr{x)  =  w\x) 
Theorem  3.4.2  and  Lemma  3.3.2  imply  that 

w2(x) 


Yr(x) 


.2,  N  p   f/(x:A)      F(x:A) 


I     fix)  F(x)     J 

uniformly  on  [—00,00)  as  N  — *  00.   The  uniform  convergence  will  hold  on  closed 
subintervals  of  J*  as  well.  Applying  Lemma  3.3.2  with 


Ah(x)  =  w(x) 
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and 


BN(x)  =  K'c(x) 


gives 


K2(x)/YT(x)  -U  hT(x);     r  =  0,l 


where 


hr(x) 


f(x:A) 
/(*) 


F(x:A)}2  ft_r 
F(*)     J    Fr(x) 


F0(x)F1(x) 


poFo(x)  +  piFi(x) 


r  =  0,l, 

(3.4.12) 


uniformly  on  closed  subintervals  of  J*  as  N  — ►  oo.  Let 


K(x)  =  4 


Pl-r 


F0(x)F1(x) 


1  2 


Fr(x)  [p0F0(x)  +  p1F1(x)\ 
be  the  corresponding  function  for  K^(x).    Then  hr  is  the  product  of  h^  and  the 
square  of  expression  (3.4.3)  from  Theorem  3.4.2.  It  is  the  product  of  two  bounded 
left-continuous  functions  with  right  hand  limits,  and  thus  is  itself  a  bounded  left- 
continuous  function  with  right  hand  limits. 

Condition  Ic  is  covered  automatically  by  the  random  censorship  model  (Gill, 
1980). 

Theorem  3.4.2  implies  there  is  a  positive  number  Mi  such  that 

ff(x:A)       F(x:A)'2 


f{x)  F{x) 


<  Mi  for  all  x  6  [—00,00], 


and  therefore 


j£.M.XM»)  =  £{7§-f$}\°M*M-> 

<      f    Mi-  h°(x)d\r(x)  <  00, 
Jl* 
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so  that  Condition  Ha  would  be  satisfied  if  S  £  X*,  since 

/   hf(x)dAr(x)  < 


CO 


was  established  for  the  log-rank  statistic  in  Gill  (1980).    Analogously,  there  is  a 
positive  number  M  such  that  w2(x)  <  M,  so  Condition  lib  follows  from 

limlimsuPPr(  /    K2(x)/Yr(x)dAr(x)  >  t) 

<  limlimsuPPr(  /    MK^(x)/Yr(x)dAr(x)  >  t) 

x]S     Af— oo  Jx 

=  limlimsupPr(M  /    I<g(x)/YT(x)dAr(x)  >  e) 

rT«5     iV— oo  Jx 

=  limlimsupPr(  /   K$ (x)/Yr(x)dAr(x)  >  4j)  =  0, 

x]S     ^_oo  Jx  M 

the  last  step  verified  by  Gill  (1980). 

The  bound  M  used  in  the  verification  of  Condition  II  also  is  central  to  a  veri- 
fication of  Condition  III  if  S  <  00.  Finally,  the  expression  for  Var(i/*)  comes  from 
substitution  of  (3.4.12)  into  Gill's  (1980)  expression 

/■oo 

/     (hi(x)  +  h0(x))dA{x) 

J— oo 

in  his  Corollary  4.3.1.  The  expressions  for  the  estimators  V\  and  V2  are  determined 
after  substitution  of 

w(x)  •  Kc{x) 

for  K{x)  in  (3.3.7)  and  (3.3.8).  The  convergence  associated  with  Vi  and  V2  follows 
from  Gill's  (1980)  Lemma  4.3.1.  □ 
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COROLLARY  3.4.2   Under  the  conditions  of  Theorem  3.4-3, 


and 


converge  in  distribution  to  a  standard  normal  variable  as  N  — ►  oo. 


Proof.     Recall  that 


No  +  N, 


in  the  discussion  before  Theorem  3.4.3,  and  apply  a  Slutsky-type  argument.  □ 

The  expressions  given  in  Corollary  3.4.2  are  most  convenient  for  calculating 
the  statistic  on  data.  If  the  support  of  the  survival  and  censoring  variables  is 
(  —  00,00),  then  S  =  00,  and  Condition  III  will  then  be  empty.  This  is  also  true  if 
no  censoring  occurs.  On  the  other  hand,  it  is  often  tenable  in  practice  to  assume 
uniformly  distributed  censoring  times  with  finite  support.  In  this  case  Condition 
III  is  required,  as  S  <  00. 

Use  of  the  Test 

Back  in  Section  2.2,  the  mixture  model  (2.2.4)  was  introduced  with  the  pre- 
sumption that  A  >  0,  i.e.,  that  larger  values  of  the  response  were  associated  with 
the  treatment  group  (See  Assumption  A2).  The  asymptotic  normal  distribution  of 
the  test  statistic  developed  in  Theorem  3.4.3  suggests  that  the  opposite  situation 
of  smaller  values  ( A  <  0)  can  be  handled  as  well,  as  long  as  the  required  conditions 
hold.  Note  that  the  special  case  of  Theorem  2.3.1  is  included  here. 

If  a  test  against  the  alternative 

G(x)  <  F(x),       with  strict  inequality  for  at  least  one  x, 
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is  desired,  the  rejection  region  will  be  in  the  lower  tail  of  the  standard  normal 
distribution.  For  A  >  0,  the  upper  tail  is  appropriate.  Absence  of  any  knowledge 
about  A  should  have  a  rejection  region  involving  both  tails. 

It  is  not  true,  however,  that  v  or  the  test  statistic  that  is  a  standardized  version 
of  v  is  an  odd  or  even  function  of  A.  The  fact  that  v  is  not  a  symmetric  function 
of  A  gives  another  aspect  in  the  consideration  of  the  choice  of  A  for  conducting  the 
test.  However, 

A: 

£(c,  +  mtC,)  =0  (3.4.13) 

holds,  since  the  sum  (3.4.13)  is  the  average  of  sums  of  scores  that  occur  from  all 
possible  underlying  rank  vectors  given  the  data  (Prentice,  1978).  Note  that  (3.4.13) 
implies 


k 


K  AC 

v  =  £  (*<«)<*  +  Mid)  =  -  E  (l1  -  z(i))c<  +  (m«  -  Mi)C)  .  (3.4.14) 


Hence  summation  of  the  scores  assigned  to  the  control  group  observations  (when 
A  >  0  is  employed)  produces  a  small  value  for  an  appropriately  standardized  test 
statistic  that  could  be  compared  with  the  lower  tail  of  the  standard  normal  distri- 
bution. 

If  larger  values  for  the  control  group  are  present,  the  assigned  scores  for  the 
control  group  will  likewise  tend  to  be  larger  than  those  scores  for  the  treatment 
group.  Then  v  (cf.  3.4.14)  based  on  the  treatment  group  scores  with  A  >  0  will 
tend  to  be  small,  and  the  test  statistic  in  Theorem  3.4.3  should  be  compared  with 
the  lower  tail  of  the  standard  normal  distribution.  Summing  the  negatives  of  the 
scores  in  (3.4.14)  would  produce  large  values  of  the  test  statistic  based  on  v,  and 
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the  rejection  region  should  then  be  based  on  the  upper  tail  of  the  standard  normal 
distribution.  This  approach  was  suggested  by  Conover  and  Salsburg  (1988)  for 
their  scores  derived  from  Lehmann  alternatives  and  uncensored  data.  Therefore,  if 
smaller  values  of  the  response  for  the  treatment  group  is  of  interest,  Theorem  3.4.3 
permits  us  to  either 

1.  Choose  a  value  of  A  <  0  to  calculate  the  scores,  and  compare  v  with  the  lower 
tail. 

2.  Take  the  negatives  of  the  scores  based  on  a  A  >  0,  and  compare  v  with  the 
upper  tail. 

3.  Take  the  scores  based  on  A  >  0  and  compare  with  the  lower  tail. 

Choices  (2)  and  (3)  will  be  equivalent  since  the  rejection  regions  are  based  on  the 
standard  normal  distribution. 

Dealing  with  Ties 

Those  who  are  practitioners  of  statistics  know  that  ties  in  the  observed  data  are 
often  prevalent.  Continuity  assumptions  on  the  survival  and  censoring  distributions 
imply  that  the  probability  of  ties  is  zero,  but  measurement  error  forces  us  to  group 
the  data  at  plausible  times.  Let  d\{$  denote  the  number  of  observed  failure  times 
in  the  treatment  group  at  time  x^,  and  let  rf(,-j  be  the  total  number  of  observed 
failure  times  at  £(,-).  The  test  statistic  v  as  in  (3.2.10)  can  be  modified  by  the  usual 
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convention  (e.g.  Prentice  and  Marek,  1979): 


i?, 


(3.4.15) 


The  stochastic  integral  form  (3.2.17)  already  covers  the  modification  as  does  the 
expression  (3.4.11)  for  the  limiting  variance.  Now  define 

&tfr(x)  =  K(x)  -  K(x-) 

to  be  the  number  of  failure  times  at  x  that  belong  to  treatment  group  r,  r  =  0,1. 
(Note  that  A  is  not  related  to  the  parameter  A.)  The  variance  estimators  are 
modified  as 


Vi  =  W     w\x) 

r=0J-°° 


Yl-r(x) 


[Y0(x)  +  Y1(x)\ 


\ ..  *£&£)  iNAA 


Yr(x)  -  1 


and 


V5 


/oo 
w2(x) 
■CO 


Y0(x)Y1(x) 
[(Y0(x)  +  Yi(x)Y 


1  ~       YoW  +  YrW-l      )  <W(*)+^(*))' 


It  is  also  mentioned  that  if  all  the  ties  at  a  particular  failure  time  belong  to  the  same 
group,  the  modifications  are  not  needed,  since  v  can  be  computed  with  a  breaking 
of  the  ties  in  some  arbitrary  order.  Ties  among  censoring  values  are  not  a  concern 
since  they  are  all  assigned  the  same  scoring  value  within  the  interval  defined  by 
adjacent  uncensored  values. 

By  the  same  arguments  in  establishing  (3.2.18)  for  u,  the  variance  expression  V2 
is  equal  to 
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with  0/0  =  0  and  W{  =  <?!>(£(,■))  —  $(£(,)).  When  there  are  no  ties  in  the  observed 
data,  (3.4.16)  reduces  to 

£«  (£)(>-£)■  <3^ 

Expression  (3.4.16)  has  often  been  interpreted  in  the  literature  as  a  weighted  sum 
of  hypergeometric  variances  because  of  the  argument  of  Mantel  (1966)  in  deriving 
the  log-rank  test  from  a  series  of  independent  2x2  contingency  tables. 

3.5     Evaluation  of  Expected  Scores 

The  forms  of  the  expected  scores  in  (3.2.4)  and  (3.2.5)  produce  very  involved 
multiple  integrations  when  some  of  the  m.j  are  nonzero  if  we  approach  the  evaluation 
in  a  brute-force  manner.  But  suppose  we  assume  that  </>((l  —  «,-))  =  1  in  (3.2.4). 
Then  the  integration  is  straightforward  (Prentice,  1978)  and  equals  the  value  of  1. 

This  suggests  that 

k 
g'(uu...,uk)  =  Y[RAl  -uiT3,     0<ux<  v<u*<l 

constitutes  a  joint  density  in  the  U{S.  Since  the  expected  scores  involve  score  func- 
tions that  only  depend  on  a  particular  «,-,  the  scores  c,  and  C,  can  be  represented 


as 


where 


c,  =   /    4>(ui)g*(ui)dui;  (3.5.1) 

<J  U 

C,=  I1  ${ui)g;  (ufidm,  (3.5.2) 

9'(ui)  =/•■■/  II  fyi1  ~  «i)mi*«i  •  •  •  dm-tdui+i  •■■duk  (3.5.3) 


"1 


<-<Ufc  J  =  1 
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is  the  marginal  density  in  u,  corresponding  to  the  joint  density  g*(v,i,. . .  ,ujt).  It 
turns  out  that  for  the  score  functions  considered  in  this  dissertation,  the  integrals 
in  (3.5.1)  and  (3.5.2)  do  not  have  nice  closed-form  expressions.  But  numerical 
evaluation  of  the  integrals  can  be  carried  out  in  a  straightforward  manner. 

The  Form  of  g'(ui) 

To  gain  some  insight,  consider  the  case  i  =  1.    Integration  over  uj, ...,Uk  in 
0*(ifi,...,Uft)  yields 


i 


Si>i)=    /   ■■■Jil{RJ(l-uJ)m>duJ}  =  Rl(l-ul)R>-1,         0<M1<1. 

To  obtain  g^u-i),  we  can  integrate  out  U3,  •  •  •  ,ujt  and  then  Ui: 

1  I  u}  k 


9l 


"*-l  "2     0 


U2 

0 
"2 

=  /^(l  -  u2)R2-1  /(l  -  u^-^-'dm 
0 

=  i?x/?2(l  -  u2)R-1  {^4^  [l  -  (1  -  «2)Hl-fl2] } 

=  RlR2  f(1-^)H2~1  +  (i-f«i)»-M  _ 


53*("3)=   /  •••/ 1 JiliRjil-usr'duj} 


For  z  =  3,  the  housekeeping  becomes  more  extensive.  We  get 

1      U3    U2      £ 

n 

ut_i  u3    0     0    J-1 

=  iZ1iZsA(s)(l  -us)**-1  J  J(l-u2)mw(l-u1)m(^duldu2 

0  0 

=  #li?2i?(3)(l  "  Ua)^"1 


U3   "2 
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o 
=  ftftft(l  -  u3)R>-1  J  (x^Tr)  (1  "  u^R2~R3~ldu> 

0 

«3 

-ftftft(i  -  ua)^"1  y  (R  _R)  (i  -  m)*-*-1* 


=  ftftft(l  -  w3)H3_1 

1 


(ft  —  ft)(ft  —  ft) 
-ftftft(l  -  u3)H3_1 
1 


{l -(!-«,)*-*} 


x  • 


(ft  —  ft)(ft  —  ft) 
=  ftftft(l  -  ua)^"1 

1 


{l -(!-«,)*-*} 


X 


1 


(ft  —  ft)(ft  —  ft)     (ft  —  ft)(ft  —  ft) 

1 


+ftftft(l  -  U3)*2-1 

+ftftft(l-u3)fll- 

=  ftftft(l-U3)H3_1 


(ft  —  ft)(ft  —  ft) 
1 


(ft  —  ft)(ft  —  ft) 

1 


+  ftftft(l-U3)fi2_1 


(ft  —  ft)(ft  —  ft) 

1 


(ft  —  ft)(ft  —  ft) 


+ftftft(l  -  u3) 


H,-l 


(ft  —  ft)(ft  —  ft) 


A  reasonable  conjecture  then  for  arbitrary  i  in  (3.5.3)  is 

<?.  (u«)  =      11  ft'     <L 

v*-1    / j=i  n  (ft  -  ft) 

1=1 


(3.5.4) 


The  validity  of  (3.5.4)  can  be  proven  by  mathematical  induction.  We  start  with  a 
result  that  will  be  used  in  the  proof. 
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LEMMA  3.5.1    The  risk  set  sizes 

Ri,      i  =  l,...,k  +  l     (%i=0), 

satisfy 

=  -EiTi '    «  =  !.•••  >*• 

U(Ri-Rt+i)        i=1  U(Ri-Rj) 

<*> 

Proof.  The  product  term  in  the  denominator  of  the  left  hand  side  can  be  regarded 
as  a  polynomial  in  i2,-+1  with  the  simple  roots  Ri,  R2, . . . ,  J2,-.  Then  the  whole 
expression  on  the  left  hand  side  is  a  ratio  of  two  rational  functions.  Since  the 
denominator  is  a  polynomial,  we  can  decompose  it  into  more  elementary  functions 
that  involve  the  factored  terms.  A  discussion  of  the  technique  can  be  found  in 
Gradshteyn  &  Rhyzik  (1980,  p.  56-57).  The  following  notations  of  /  and  <f>  mimics 
theirs  and  is  to  be  assumed  only  in  the  local  context  of  this  proof.  Denote 


f(R)  =  n(*-#); 

<f>(R)    =    1 


as  the  two  rational  functions.  Then 


HRi+i)  _         Ai         |         A2  Ai 


where 


Now 


f(Ri+i)      Ri+i  —  Ri     Ri+i  —  R2  Ri+i  —  /2i' 


^CRO         A      HM  ,      <l>(Ri) 

1    fW        2    fW  '        '~/W 


f(Rt+l)  =  (Ri+l  -  RtXRi+i  -R2)---  (Ri+1  -  Ri), 
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So 

f'(Rt+1)  =  fl(Ri+i  -  Ri)  +  ri(ifc+i  -  Ri)  +  •-.  +  11(12,+,  -  R,), 

which  implies  that 


(=1 
1*1 


1/2 


1=1 


/'(i?i)    =    11(^1  -12/), 


i=i 


/'(ZEa)     =     IJ(^  -Ri), 


1=1 

1/2 


as  all  other  terms  drop  out.  Since  <f>(Ri)  =  1,  /  =  1,  ••-,?',  we  have 


W-+i) 


1 


(Rn.i-Ri){jl(Ri-Rt) 

t*i 
+ 


l 


{Ri+1-Ri)(U(Ri-Ri) 

i=i 


("I) 


i-1 


1 


U(Ri-Ri) 
1=1 

iyi 


+  •••  + 


" 

1 

t'+l 

-Ri) 

l*i 

_ 

f— IV-1  f-i  '+1 


(-1)-1  ft 


i=i 

Hi 


Coupled  with  the  fact  that 


<KRi+i)       i 


/(ft+i)       ("I)*' 
we  get  the  desired  equality.  □ 


1 


U(Ri-Ri+i) 
n=i 
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As  a  first  step  in  the  induction  argument,  note  that  (3.5.4)  holds  for  i  =  1.  Next 
assume  that  (3.5.4)  holds  for  i.  From  this  assumption  it  must  be  proven  that  a 
version  of  (3.5.4)  with  i ;  +  1  replacing  i  everywhere  is  also  true. 

From  the  joint  density  in  the  ui, . . . ,  u*,  we  need  to  integrate  out  Uj, . . . ,  u,-  (call 
this  "subrange  1")  and  U(t+2), . . . ,  u*  ("subrange  2")  in  order  to  obtain  the  marginal 
density  <7*+1(u,+i).  Integration  over  subrange  2  yields 

Rl+l(l  -  «l+i)fi<+1-1  /•  •  •/  fl  H/(l  -  tii)Mi*ii.  (3.5.5) 

subrange  1  J 

To  see  what  happens  over  subrange  1,  write  (3.5.4)  as 


t-i 


^(l-u,)*-1  [  ]JRJ' 
U'=i 


£(l_-u,-)H'-*' 


i=i  n(^--ffi) 

1=1 


(3.5.6) 


Note  the  similarity  with  (3.5.5).  Because  we  are  assuming  the  case  for  i  is  true,  the 


expression 


«-i 


Vj'=i 


<   (1  -  «,)*'-* 

1=1 


in  (3.5.6)  is  the  value  of  the  integral  of  <7*(ui, . . . ,  Uk)  over  «j, 
quent  integration  of  (3.5.7)  over  ut-  produces 

ri  ^')  e  i — - —  /  (i  -  fif*-**-** 

*'->       /  >=»  n  (fl,  -  fl,)   o 

1=1 

Ml 
=  t  T-J (n-V)  [l  -  (1  "  «m)*'-*+']  • 

1=1 


(3.5.7) 


,u,_x.  The  subse- 


(3.5.8) 
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The  substitution  of  (3.5.8)  into  (3.5.5)  gives 


tf'-i         /  i-»     n  (#/  -  #;) 


(1  -  UH.!)^"**1  + 


(#«+!   -  #j) 


iij  —  iii+i 


/ 


-  ( ff  Ru>) 


\ 


2—i  i+i  2—i    t- 


1=1  (si 

<?0  I*; 


>=i  n(Ri-Rj)     n  (Ri-Ri+i) 

(=i  1=1 

iyj  i?<i+i 


/ 


(3.5.9) 


with  the  last  step  arising  from  substitution  of  the  equation  in  Lemma  3.5.1  to  get 
the  last  term  on  the  right  hand  side  of  (3.5.9).  Collecting  the  t '  +  1  terms  in  (3.5.9) 
into  one  summation  gives 

V-»       /;='   U(Ri-Rj) 

which  is  the  density  in  «,+1  that  matches  the  form  (3.5.4)  for  t/,-.  This  completes 
the  induction  argument  that  (3.5.4)  holds  for  all  i  >  1. 

Although  the  form  <7*(u,)  in  (3.5.4)  is  not  simple,  analytical  or  numerical  eval- 
uations of  c,  and  C,  will  involve  single  integrals. 

3.6     Properties  Under  Ha 


The  behavior  of  the  linear  rank  test  statistics  derived  in  previous  sections  can  be 
studied  in  terms  of  (Pitman)  asymptotic  relative  efficiency  (ARE).  The  structure  of 
local  alternatives  of  parameter  values  {ttn}  converging  to  the  null  hypothesis  value 
7T0  =  0  will  allow  us  to  compare  the  local  powers  of  v  for  different  scores. 
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Gill's  (1980)  foundational  work  will  again  be  applied.  In  particular,  his  Chapter 
Five  provides  sufficient  conditions  for  asymptotic  (TV  — »■  oo)  normality  for  stan- 
dardized v  under  a  sequence  of  local  alternatives  in  the  location-shift  model.  Gill's 
(1980)  results  can  be  used  to  evaluated  the  "efficacy"  of  the  linear  rank  test  statistic 
based  on  v  and  the  sequence  of  local  parameter  values.  The  Pitman  ARE  of  one 
test  with  a  competitor  can  be  established  by  evaluating  the  ratio  of  their  efficacies. 

The  efficacies  in  general  depend  on  the  censoring  distributions  for  the  two  sam- 
ples. Thus  we  can  study  the  influence  of  censoring  distributions  on  behavior  of  the 
tests.  For  the  exponential  distribution  (extreme- value  in  the  logarithmic  scale)  it  is 
known  that  fixed  (not  dependent  on  N)  and  equal  censoring  distributions  for  the 
control  and  treatment  populations  are  sufficient  for  the  log-rank  to  be  fully  efficient 
(Crowley  and  Thomas,  1975).  Gill  (1980)  extended  this  result  to  other  distribu- 
tional forms  including  the  logistic,  double-exponential,  and  normal  distributions. 

The  Structure  of  Local  Alternatives 

Let  {^n}  denote  a  sequence  of  parameters  belonging  to  (0,1]  where 

a 

■kn  = 


y/N 

for  some  constant  a  >  0.  This  choice  has  been  used  by  authors  in  the  uncensored 
data  case  (Johnson  et  al.,  1987).  When  convenient,  the  value  of  a  —  1  can  be 
assumed  without  loss  of  generality.  The  sequence  of  corresponding  control  and 
treatment  population  survival  functions  is 

FN(x)    =    F(x)         (unchanged  from  H0) 
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GN{x)    =    wNF{x:A)  +  {l-irN)F(x). 
Consistent  with  previous  definitions,  the  respective  cumulative  hazard  functions  are 
A^(x)  =  -log(F(x))    and    Af  (x)  =  -  \og(GN(x)). 

Sufficient  Conditions 

In  sections  3.3  and  3.4  of  this  manuscript,  Gill's  (1980;  p.  72)  conditions  sufficient 
for  a  limiting  null  distribution  were  stated  and  verified  for  v  in  the  mixed  model 
at  (2.2.4).  Under  a  sequence  of  local  alternative  parameter  values,  Gill's  (1980) 
conditions  on  his  page  106  in  addition  to  those  given  on  his  page  72  (Gill,  1980) 
will  have  to  be  verified  in  order  to  establish  asymptotic  properties. 

We  adopt  the  notational  convention  of  Gill  (1980)  with  suitable  modifications 
for  stating  these  conditions.  To  deal  with  local  alternatives,  certain  parts  of  the 
conditions  in  our  Section  3.3  must  be  replaced  with  more  general  statements.  The 
following  is  a  list  of  Gill's  (1980)  conditions  stated  in  terms  of  our  mixed  model 
setup  (2.2.4). 

I*.  There  exist  7r(x)  such  that 


^w-JaysiTfrW0"1)'    r=0'1- 

uniformly  on  each  closed  subinterval  of  {x  :   F{x)  >  0}.  Also,  let 

70*)  =  7o(*)  -  7i(*)- 

Let  k(x)  and 

K(x)  =  w{x)K*c{x)  =  w(x);Y°^Yl(<X) 


Y0(x)  +  Y1(x) 
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be  such  that 


lN°.*?1K(x)  =  k(x) 


n^oo  v  n0Ni 

uniformly  in  probability  on  closed  subintervals  of 

T  =  {x  |min(fo(a;)  >  0,.Fi(z)  >  0) }. 

The  function  k(x),  called  a  "limiting  weight  function"  by  Gill  (1980),  is  left  contin- 
uous with  right  hand  limits  such  that 


k+(x)  =  lim  k(t) 

t— »x+ 


is  of  bounded  variation  on  closed  subintervals  of  J*.  The  function  k(x)  is  defined 
to  be  zero  in  regions  outside  of  J*. 

a.  FN(x)  converges  uniformly  on  J*  to  F(x)  as  N  — ►  oo;  GN(x)  converges  uniformly 
on  J*  to  G(x)  as  iV  — ►  oo.  Also, 

Ar{x)  =   lim  A?(x) 

N—KXl 

is  finite  on  J*. 

b.  K2(x)/YT(x)  converges  uniformly  in  probability  to  hT(x)  as  N  — ►  oo  on  each 
closed  subinterval  of  J*,  where  hr(x)  is  left-continuous  and  has  right-hand  limits  on 
each  closed  subinterval  of  J*.  Also, 


lim  hr(t) 

t—*x+ 


must  be  of  bounded  variation  on  each  closed  subinterval  of  J* 


c.  YT(x)  — ►  oo  in  probability  as  N  — ♦  oo  for  each  x  G  I*. 


85 


II*.  Let  S  =  sup  I*.  If  S  i  I,  then  for  r  =  0, 1: 


a. 


/    hr(x)dAr(x) 
Ji' 


<  oo. 


limlimsupPr(  /      K2{t)/Yr(t)dA?(t)  >e)  =  0 

*T«S     N— oo  -'[r,5] 


for  every  e  >  0. 


III*.  If  S  <  oo,  then  for  r  =  0, 1: 


/        K'(x)/Yr(x)dA?(x) 

J(S,oq) 


converges  in  probability  to  0  as  N  — ♦  oo.   From  page  106  of  Gill  (1980)  the  extra 
conditions  are 

IV*.  If  Si  I",  then  for  r  =0,1: 


/    \k{x)fr(x)\dAo(x)  < 


oo 


imlimsupPr(/      \K(t)\  d\?(t)  -  dA0(t)\  >  e)  =  0 

'IS     N—oo  J[x,S]  I 


for  every  e  >  0. 
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V*.  If  S  <  oo,  then  for  r  =  0, 1: 


converges  in  probability  to  0  as  N  — ►  oo. 


/       \K(x)\  dA?(x)-dAo(*) 


Some  of  the  components  of  11*  and  III*  are  essentially  no  more  than  a  substitu- 
tion of  A^(x)  for  Ar(x)  in  conditions  II  and  III.  Also,  note  that 

bg(x)  =  A0(x)  =  A(x). 

In  order  to  verify  I*-  V*  for  the  mixed  model  setup,  we  follow  our  stategy  in  the 
null  hypothesis  situation,  where  each  condition  was  verified  in  two  steps: 

1.  Establish  that  such  conditions  have  been  verified  by  Gill  (1980)  for  the  case 
K(x)  =  Kq{x)  (log- rank);  and 

2.  Use  (1)  along  with  the  fact  that  our  weight  function  K{x)  can  be  expressed 
as  a  product  of  a  bounded  weight  function  w(x)  and  Kq(x). 

This  approach  will  be  referred  to  as  the  "two-point"  mechanism  in  the  remainder 
of  this  section. 

In  the  verification  of  I*— III*,  the  case  r  —  0  brings  us  back  to  the  conditions 
already  verified  for  the  null  hypothesis  case  of  Section  3.3.  Only  the  case  r  =  1 
remains.  We  start  with  condition  I*a.  The  functions  Gn(x),N  >  1,  and  G(x)  are 
continuous  survival  functions  and 


lim    Gn(x)  =  G(x):  lim  GWx)  =  G(x)   for  all   x. 

N->-  oo  V     '  V     '  W-oo  V     '  V     ' 
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Due  to  a  well-known  result  due  to  Polya  (see  e.g.  Serfling  (1980)),  the  uniform 
convergence  is  proved.  Condition  I*b  follows  just  as  condition  lb  did  because  K(x)  = 
w(x)Kc(x)  where  w(x)  is  bounded. 

To  evaluate  7(2)  and  other  expressions,  we  need 

ax 

9N(x) 

GN(x) 

*N(x)f(x  :  A)  +  (1  -  xN(x))f(x) 

wN(x)F(x  :  A)  +  (1  -  7rN{x))F(xY 

and 


A0"(x) 


dAo(x) 
dx 

F(x) 
=    A0(x). 


Then 


and 


d^(x) 
d\0(x) 


=  1, 


JAf(x)  Af(x) 


dA0(x) 


tf/ 


A0(x) 
7Tyv(x)/(x  :  A)  +  (1  -  TN(x))f(x) 


jN(x)F(x  :  A)  +  (1  -  irN(x))F{x) 


F(x) 


Therefore  7o(x)  =  0,  and  7(x)  =  —  71  (x).  Now, 


7i(x)  =   lim 

iv— .00 


/  aw    /^Af(x)     \ 

V  N0  +  Ni  \dA0(x)        ) 
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..        /     No 
=    hm 


i  V  NQ  +  Ni  v  V  <*Ao(*) 


n^oo  V  No  +  N 


Upon  substituting  7T/v  =  tw,  we  get 


f(x:A)  _  F(x:A) 


*M  =  J5SL  V  *  +  *  V  55+aT  U  ( W  "  0  +  *< 

//fa  :  A)       F(x:A)' 


=       VV'l/92aU,*(X) 

as  N  — *  oo,  where  w*(x)  was  denned  at  (3.4.6)  in  Theorem  3.4.2.  This  last  con- 
vergence is  uniform  on  each  closed  subinterval  of  {x  :  F(x)  >  0}  by  the  same 
reasoning  (cf.  (3.4.12),  two-point  mechanism)  employed  in  Theorem  3.4.3.  Likewise 
as  in  (3.4.12), 


uniformly  in  probability  on  closed  subintervals  of  J",  with  k(x)  being  left  continuous 
with  right  hand  limits  and  A;+(x)  of  bounded  variation,  again  since  w(x)  is  bounded 
and  the  version  of  these  properties  for  the  log-rank  statistic  have  been  verified  in 
Gill  (1980). 

Condition  II*a  is  the  same  as  condition  II  in  the  null  hypothesis  case  and  thus  is 
already  verified.  Verification  of  conditions  II*b  and  III*  again  makes  use  of  the  two- 
point  mechanism,  along  with  Lemma  4.3.2  of  Gill  (1980)  and  his  proof  of  Proposition 
4.3.3.  A  pair  of  inequalities 

K'c(x)  <Y0(x)Y1(x)      Nq  +  Nj  N0  +  Nx  Y0(x) 

Yx(x)    ~     N0      Nx    Y0(x)  +  Yx(x)-       Nx         N0 
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in  that  proof  are  used: 


J[x,s)  Yt(t)   dAo(t)      °v  ' 


dA(£)     since  A(x)  =  A0(x) 


*"(W-1)  +  1. 


(by  first  inequality) 

A^+  Nx  Y0{t)s 
No  j 


7TNw'(t) 


dA(t) 


M^-l)  +  l 


+  1 


rfA(<) 


(by  second  inequality) 

iVo  +  Ni  Yo(t) 
No 


=  1  -W— 


1 

a  •  iy 

'(<) 

yNo  +  Ni 

Mt 

-1)  +  1. 

<*A(*) 


< 


a  I      MJtktJLM 
J[r,S]         \       Ni  N0    . 


«*iW-i)+i 


<*A(<) 


+ 


'No±NiY0(t) 


I    m2  i '"" '  """V 1  dhti) 

J[x,s]       \     Ni        N0    ' 


(where  0  <  Mx  <  oo  and  0  <  M2  <  oo  since  £(*)  and  w'(t)  are  bounded) 


<aMi  I      ( 

J[X,S]  \ 

+M2  ( 


Ni        N0  . 
Np  +  Nj  Y0(t) 
,s]  V      Nx        N0  , 

(since  ^i  -  1  >  0  for  alii) 


rfA(i) 
rfA(<) 


=  (aM1+M,)/      (4  +  fiM)^, 
'Ml  V      AT,        N0  J       K  ' 

-^  (aJl/,  +  M2)—  /      FQ(t)d\(t) 


0   as   a;  |  <S. 
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Thus  condition  II*b  holds.  The  same  series  of  inequalities  gives  Condition  III  since 


{aMx  +  M2)—  (        F0(x)dA(x)    =    0 

Pi  J(S,oo) 


holds  on  account  of  Fo(x)  =  Fi (x)  =  0  for  all  x  >  S. 

As  for  the  remaining  conditions,  let  us  begin  with  condition  IV*a.   By  similar 
arguments  to  those  used  in  the  proof  of  Theorem  3.4.3, 


k(x)  =  u>j(x) 


F0{x)Fi(x) 


po(x)FQ(x)  +  pi(x)Fi(x) 


where  wl(x)  is  of  the  same  type  but  does  not  necessarily  equal  the  w*(x)  function 
employed  in  calculating  7(x).  That  is, 


w{{x)  = 


/(x:A»)      F(x :  AQ 


/(*) 


Ft*)     ' 


where  Ai  does  not  necessarily  equal  A.  Condition  IV'a  then  becomes 


7(o 


CO) 


Wi(x)w*(x)- 


Fo(x)A(x) 


dA(x)  <  oo, 


pd(x)F0(x)  +  p1(x)Fi(x) 

and  is  equal  to  the  integral  in  equation  (3.4.11)  for  Var(i/*')  when  w*(x)  =  wfa). 
The  finiteness  of  this  last  expression  holds  by  the  two-point  mechanism  when  w*(x) 
and  wl(x)  are  bounded  on  I*. 

Conditions  IV'b  and  V*  follow  by  again  employing  our  two-point  mechanism  in 
combination  with  the  two  inequalities  (proof  of  Proposition  4.3.3)  from  Gill  (1980). 


/      \K(t)\\dA?(t)-dA(t)\ 

J[x,S] 

=  [     HOI 

J[x,S] 

=  [\m\ 

J[x,S] 


yv0yvx  ro(t)Ki(0    n0  +  n, 


ViVo  +  iVi   7V0     Ni    Y0(t)  +  Yi(t) 
YQ(t)Yx{t)     No  +  Nr 


dA?(t) 


dA(t) 


-1 


dA(t) 


No     Nx    Y0(t)  +  Yx(t) 
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No 


Ni 


j  N0 + m 


< 


aM  I 

J[x,S] 


No  +  Ni  V  ^o  +  M 
Yo(t)Yi(t)     N0  +  Nl 


irNw'(t) 


TN(£ffl-l)+l 


d\(t) 


No     Ni   Y0(t)  +  Yx(t) 


No 


iV, 


1 


MW-0+1 


< 


< 


V  No  +  Nt  V  No  +  AT, 
(where  0  <  M  <  oo  bounds  u>(a;)  and  iw*(or)) 

y0(OW)    Wo  +  M 


dA(t) 


./fa 


iVo     Nt   Y0(t)  +  Yi{t) 
N0  +  iVx  r0(0 


rfA(<) 


[*,£]         iVi  A0 


■d\{t) 


M—  [      F0(t)dA(t) 

Pi  J[x,S] 

0   as   x  1  S. 


Application  of  this  last  result  in  the  same  way  as  that  for  Conditions  II*b  and  III* 
finishes  the  verification. 

Hence  the  asymptotic  properties  of  Gill  (1980)  under  local  alternatives  indeed 
apply  to  linear  rank  test  statistics  with  bounded  w(x)  for  the  testing  situation  about 
T  in  our  mixture  model  at  (2.2.4). 


Efficacy  Formula 

Under  the  sequence  of  alternatives  {ttn},  the  efficacy  of  the  linear  rank  statistic 
v  appropriately  standardized  by  V\  V-i,  or  Var(i/*)  is 


e(k(x)) 


J  k(x)~f(x)dA(x) 
o 


(3.6.1) 


This  definition  (see  Gill  (1980)  for  more  details)  is  general  enough  to  cover  other 
test  statistics  that  are  distinguished  by  some  choice  of  the  limiting  weight  function 
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k(x).  For  instance,  if 

k(x)  =  ke(x)  =  .  °  *     *  *     .        , 

/Jolxj^oia;)  +  pi[x)Fi(x) 

then  substitution  into  (3.6.1)  yields  an  efficacy  expression  for  the  log-rank  statistic 

(Gill,  1980). 

The  asymptotic  relative  efficiency  of  two  statistics  corresponding  to  ki(x)  and 

£2(2)  will  be  the  ratio  of  their  corresponding  efficacies 

e«(kx(x),k2(x))  =  e(fc1(ar))/e(jfe3(a:)). 

More  precisely,  this  formula  computes  the  efficiency  of  a  test  statistic  configured 
by  ki(x)  with  respect  to  one  configured  by  Ar2(a:)  under  the  sequence  of  alternatives 

A  test  statistic  that  results  in  the  maximum  possible  value  for  (3.6.1)  in  some 
class  for  the  given  testing  problem  is  desirable.  This  maximum  efficiency  will  exist 
for  a  statistic  that  results  in  the  largest  efficacy  value  (3.6.1).  Gill  (1980,  pp.  107- 
108)  has  shown  that  within  the  class  K,  (cf.  Section  3.3),  a  test  statistic  formed  by 
a  member  K(x)  that  has 

K.)«  ff('»  ,^,)  (3.6.2) 

po(x)F0{x)  +  pi(x)Fi{x) 

as  its  limiting  weight  function  will  maximize  (3.6.1).  The  aspect  of  proportionality 
comes  from  the  potential  dependence  of  the  efficacy  on  the  censoring  distributions 
(Gill,  1980).  It  can  be  observed  that  (3.6.2)  is  satisfied  by  a  k(x)  with  a  true  value 
of  A,  and  the  function  7(x),  that  are  both  presented  on  page  88  of  this  manuscript. 
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So  in  the  presence  of  censoring,  we  are  enabled  to  explore  the  behavior  of  v 
against  competitors  from  the  class  /C  under  a  sequence  of  local  alternatives  {tat}. 
Possibilities  for  the  competitors  are  test  statistics  with  different  values  for  A  in  the 
score  functions,  or  the  location-shift  model  test  statistics. 

Efficiency  Expressions 

The  value  of  7(2)  depends  on  a  chosen  value  for  A.  A  desirable  choice  would 
be  the  true  value  for  A,  denoted  by  A*,  of  course.  Define  wt{x)  as  the  weight 
function  that  has  this  true  value  and  in  turn  determines  f(x).  Since  the  true  value 
is  unknown  in  practice,  let  wch(x)  be  the  weight  function  based  on  the  chosen  value 
of  A  (Ac/J  for  the  test  statistic  v.  Correspondingly,  let 


w[(x)    =      lim  wt(x) 

N— >oo 
W"ch(X)      =      Jim    V>ck(x) 


be  the  limiting  weight  functions  of  wt{x)  and  wch(x).  If  we  wish  to  compare  the 
efficiency  of  u  based  on  a  chosen  value  for  A  with  respect  to  the  ideal  choice  of  the 
true  value  for  A,  let  kck(x)  be  the  limiting  weight  function  corresponding  to  the  v 
that  is  used  for  conducting  the  test,  and  let  kt(x)  be  the  limiting  weight  function 
for  the  ideal  test  based  on  the  true  value  of  A.  Also  suppose  that  this  indexing 
is  used  on  the  corresponding  score  functions  <f>(u)  and  $(u),  and  additionally  the 
density  f(x  :  A)  and  survival  function  F(x  :  A).  Note  the  similarity  in  structure  to 
the  uncensored  data  ARE  investigations  of  Section  2.3. 

In  the  situation  of  studying  the  behavior  of  a  test  statistic  v  in  the  possible 
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presence  of  censored  data  when  A  is  misspecified,  the  efficacy  of  the  linear  rank 


statistic  v  is 


/  kch(x)f{x)dA(x) 


j  ^^l^f^)klh{x)dA(x) 


Vp^\7  km  L£$+yi(x))  «*(*m«) 


J  F0(x)Fi(x)        V     cflA    J  p0Fo(x)+piFi(x)/  v    > 


Popld 


fch(*:*)  _  Fch(x:A) 


/(*) 


/«(«:&)         H,(x) 


F(x)     J    I    /(x)  F(x)  J 


/tf  9/  WA(*) 


\fch(x:A)         Fch(x:A)]2 

}(x)  F{x)     J     p0F0(x)+p1F1(x)' 


&$2ftMu(*) 


Since  under  the  null  hypothesis  ir  =  0, 


F0(x)  =  F(z)L0(x);         ft(x)  =  F(x)L1(x), 


the  efficacy  of  v  based  on  wch(x)  becomes 


poPia 


I 


Ich(x:A)  Fch{x:A) 


ft(x:A)   _     F,(x:A) 
f(x)  F(x) 


F(x) 


L0(x)L1(x) 


PoLo{x)+p1Li(x) 


dA(x) 


'    \fch(x:A)  _  Fch(x:AV2 
.     /(*)  F(x) 


/ 


F(x) 


L0(x)L\(x) 


,p0L0(x)+p1Li(x) 


dA{x) 


(3.6.3) 
Expression  (3.6.3)  can  also  to  be  used  to  compute  efficiencies  of  the  standard 
test  statistics  (log-rank,  etc.)  based  on  the  location-shift  model  in  relation  to  the 
mixed-model  based  test  statistics.  As  done  in  the  example  of  Section  2.3,  denote 
the  score  functions  in  the  standard  tests  by  4>ch{u)  and  $c^(u),  and  let  the  mixed- 
model  score  functions  be  given  by  4>t{u)  and  $t(u).  The  resulting  calculation  is  the 
Pitman  asymptotic  relative  efficiency  for  the  location-shift  based  statistic  relative 
to  the  mixed-model  statistic  when  the  underlying  distribution  is  represented  by  the 
mixture  model  at  (2.2.4). 
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Next  let  us  consider  three  scenarios  for  the  censorship  distributions  correspond- 
ing to  Lo(x)  and  L\{x)  and  see  how  the  efficacy  expression  (3.6.3)  turns  out. 

I.  There  is  no  censoring. 

The  relation  Lq(x)  =  L\(x)  =  1  holds  here,  which  means  that  F0(x)  =  F\(x)  = 
F(x).  Upon  the  substitution 

t  =  -\og(F(x))  =  \(x), 


(3.6.3)  reduces  to 


PoPia' 


I 


JcHJF-He-'y.A)       Feh(F-»(e-'):A) 
/(F">(e-«))  F(F-l(e-)) 


r/,(F-'(e-):A)  _  F.(F-'(e-):A)  1     -t  rf, 
.    /(F-»(e-«))  F(F-'(e-'))    J 


/ 


/Ch(F-'(e-'):A)       Feh(F-'(e-'):A) 


i3 


/(F"i(e-«))  F(F->(e-)) 

and  the  second  substitution  1  —  u  =  e_<  yields 


e-*dt 


1  2 


F0/?ia2  /  [<MU)  ~  $ch(u)\  [&(«)  -  #t(«)]  <*U 

0 


(3.6.4) 


J  [<f>ch{u)  -  $ch{u)}2  du 

0 

It  would  be  comforting  to  know  that  the  denominator  of  (3.6.4)  reduces  to  a  form 
that  is  compatible  with  the  expression  displayed  in  Theorem  2.3.2 — the  variance  of 
v  for  uncensored  data.  The  next  lemma  helps  out. 

LEMMA  3.6.1    Under  the  mixed-model  setup  at  (2.2.4),  the  relation 

l  i  l 

J  $ch(u)$t(u)du  -  J  $t(u)<j>ch(u)du  -  J  $ch(u)<f>t(u)du  =  0 


holds. 
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Proof.    Recall  that 


/  (f>(u)du  = 


from  Lemma  2.3.1.  Combined  with  Lemma  3.3.1,  this  implies  the  relation 


U  1 

/  (j>(v)dv  =  -  I  <t>(v)dv  =  -(1  -  u)$(u). 


An  additionally  helpful  relation  is 


9'(u) 


du 


V 

=  -L-  [*(«)  -  *(„)] . 

1  —  u 


1       1 


1-ti 


[*((u)-^cA(u)]du 


Now  integration-by-parts  yields 

i 

u 

$t{u)  J  <t>ck(v)dv      -f\j   4>ch{v)dv 

0  J0       o 

1 

=  [-9«(u)(l  -  «)*efc(u)]J  -  /  [-(1  -  u^ti)]  -i_9«(ti)</u 

^  1  —  u 

0 

+  /  [-(1  -  u)9c/l(u)] ^(u)rfti 

J  1  —  u 

0 

1  1 

=  $«(0)$cfc(0)  +  y  9cfc(u)9«(u)<fu  -  /  9cfc(u)&(u)<fo 

0  0 

=  J  9cfc(u)9(-(u)du  -  J  $ch{u)(f>t{u)du      I  =  J  $t{u)<j>ch(u) )  .        (3.6.5) 

0  0  \        0  / 

The  step  to  the  last  line  (3.6.5)  is  aided  by  the  fact  that 

9(0)  slim  *(«)  =    lim    f^/^-l)  =7-1  =  0. 

«-o  i— oo  y   f(x)        y      i 


Rearrangement  of  terms  at  (3.6.5)  produces  the  relation.  □ 
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Lemma  3.6.1  tells  us  that 

i 


o 

This  informs  us  that 

i 


/  [<f>ch{v)  -  $ch(u)}  [<f>t(u)  -  $t(u)}  du 

0 

1  1 

=  J  <t>ch{u)(f>t{u)du  -  j  $t{u)4>ch{u)du 

0  0 

1  1 

-  j  $ch(u)<f>t(u)du  +  j  $ch{u)$t(u)du 

0  0 

1 
=  j  <f>ch(u)(j>t(u)du.  (366) 


J  [<t>{u)  -  $(U)]2  =  J  4>2{u)du,  (3.6.7) 

o  0 

which  matches  the  integral  in  the  variance  expression  of  Theorem  2.3.1.   Because 

the  efficacy  (by  (3.6.6)  k  (3.6.7))  when  <f>ch(u)  =  <j>t(u)  is 

i 

/  <t>2t{u)du, 
o 

the  corresponding  efficiency  for  misspecified  A  when  there  is  no  censoring  matches 
the  formula  due  to  Johnson  et  al.  (1987)  stated  in  Theorem  2.3.2. 

II-  The  censoring  distributions  are  equal. 

Let  L{x)  be  the  common  function  for  L0(x)  and  l*(x).  Then  we  have 

F0(x)  =  Ft(x)  =  F(x)L(x). 

Putting  these  into  (3.6.3)  along  with  the  two  integral  substitutions  in  the  first 

scenario  leads  to 

"  l 

f[<f>ch(u)  -  *ck(u)]  [<fn(u)  -  •1(ti)]Z(F-«(l  -  u))du 


PoPio? 


f  [4>ck{u)  -  <bch{u)\2  L(F-*(1  -  u))du 


(3.6.8) 
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The  resulting  efficiency  for  misspecified  A  is 


/  [<M")  -  *c*(ti)]  [<M")  -  $t(u)]  L{F-\\  -  u))du 

0 


/  [**(«)  -  ^(u)]2  L(F-»(1  -  u))«fci  /  [&(«)  -  $,(«)]2 1(F-i(l  -  u))rfu 

0  0 


(3.6.9) 


This  is  an  important  case  since  the  assumption  of  equal  censoring  distributions  for 
the  control  and  treatment  groups  is  often  done  in  practice. 


III.  The  censoring  distributions  are  not  equal. 

The  same  techniques  as  those  used  in  the  first  two  scenarios  yields 


/[M«)-^(u)r 

0 


L0(F-'(l-u))Li(F-»(l-u)) 
PoL0(F-i(l-u))+PiI.i(F-i(l-«))J 


du 


(3.6.10) 


for  the  efficacy  (3.6.3)  and 


/  IMu)  -  **(«)]  [Mu)  -  *,(«)]  [^^-.(^^(Aff^l  rf« 


LofF-'d-ttjjLiCF-'fl-u)) 


Var(^)Var(I/r) 


(3.6.11) 


for  the  efficiency  of  interest,  with 

i 


[p0Lo(F-i(l  -  u))  +  pMF-^1  -  u))\ 


du 
(3.6.12) 


and  Var(z/t")  is  defined  in  the  same  way.    Note  that  these  "variance"  expressions 
match  up  with  (3.4.11). 


CHAPTER  4 
FORMS  OF  THE  TEST  STATISTIC 


4.1     Introduction 


The  methods  developed  in  Chapter  Three  will  now  be  applied  to  to  the  mix- 
ture model  assuming  some  of  the  more  common  distributional  forms  for  F(x  :  A). 
Types  of  distributional  forms  include  the  normal,  logistic,  extreme-value,  double- 
exponential,  and  Cauchy. 

The  generation  of  specific  score  functions  makes  up  Section  Two  of  this  chapter. 
Section  Three  focuses  on  the  evaluation  of  the  expected  scores  for  uncensored  data. 
Conditions  that  were  presented  in  Theorem  2.3.1  as  sufficient  for  an  asymptotic 
normal  distribution  of  the  test  statistic  based  on  the  expected  scores  or  the  ap- 
proximate scores  are  also  considered.  Section  Four  considers  score  systems  for  the 
censored  data  case.  The  conditions  in  Theorem  3.3.1  that  are  sufficient  for  the  use 
of  the  approximate  scores  in  place  of  the  expected  scores  in  the  censored  data  test 
statistic  v  are  investigated. 
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4.2     Score  Functions  for  Inference  about  it 


Recall  from  Theorem  3.2.1  that  the  uncensored  and  censored  score  functions  for 
testing  H0  :  x  =  0  are  defined  as 


4>(u)  = 


f(F-l(l-u):A) 
/(F-»(l-u)) 


-1 


and 


$(«)  = 


F(F-X(1-«):A) 


-1 


F(F-l(l-u)) 

respectively.  These  score  functions  will  depend  on  the  parameter  A  in  the  model 
(2.2.4).  In  the  following  we  exhibit  the  forms  of  <f>  and  $  corresponding  to  some 
selected  location-shift  type  survival  function 


F(x  :  A)  =  F(x  -  A). 


The  inverse  function  of  the  survival  function  F  when  A  =  0  is 


F-1(l-u)  =  F-1(l-u:0) 


We  now  consider  some  specific  forms  for  F. 


Normal  score  functions 


Let 


^.:A)-/^-.-«^*. 


The  uncensored  score  function  is 


4>MN(U)  =  exp  (A  (f-^I  —  «>  — 2"))  —  1- 
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Johnson  et  al.  (1987)  considered  the  resulting  approximate  normal  scores,  which 
arise  from  the  substitution 


u  = 


N  +  l 


at  the  observed  point  x^y  Since  F(x  —  A)  does  not  have  a  closed  form,  the  censored 
score  function  can  only  be  represented  as  $(u)  is  above: 

$mn(«   =  — \ ■ '-  -  1. 

1  —  u 

Logistic  score  functions 


Here  we  have 


F(x  :  A)  = 


e-(x-A) 


1  +  c-(*-A) ' 
whereupon  the  uncensored  score  function  is  given  by 

i  +  e-F-m-u) 


<f>Mt{u)  =  e 


1  +  c-[F->(l-u)-A] 


-1. 


Use  of  the  equation 


exp(-^-1(l  -«))  = 


1  -u 


u 


provides  the  simplification 


<£ml(")  =  ez 


eA  +  u(l  -eA) 


-1. 


Similarly,  the  censored  score  function  turns  out  to  be 


$ml(u)  =  zL 


eA+u(l  -eA) 


-1. 


Double-Exponential  score  functions 


For  this  distribution, 


F(x  :  A)  =  < 


(ie-<*-A),         if  x  >  0; 


l_Ie(»-A)j     if  a-  <  o. 

Therefore,  the  required  uncensored  score  function  is 


4>mde(u)  =  ' 


1, 


if  F_1(l  -«)<0; 


;2F-»(i-u)-a  _  1;    ifo<F"1(l-u)<  A; 


UA-1, 
With  the  use  of  the  inverse  function 

flog[2u], 


if  F~1(l-u)>  A. 


F~1(l-u)  =  { 

this  can  be  re-expressed  as 


if  F-^l-u)  <  0; 
-log[2(l-u)],    ifF"1(l-u)>0, 


4>mde{u)  = 


e-A-l,         if0<u<±; 


1,     if  |<u<  l-ie"A; 


4(l-u)2 


I  ea  - 1, 

The  censored  score  function  is  expressed  as 


if  1  -  ±e"A  <  u  <  1. 


(  u(l-e-A) 
1-u, 


$mde(")  =  < 


if  0  <  u<  i; 


-* £l^—      if  -  <  u  <  1  -  ie"A- 


eA-L 


if  1  -  ±e"A  <  u  <  1. 


Cauchy  score  functions 
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We  have 


F(x  :  A)  = arctan(x  -  A), 

2         IT 


so  that  the  uncensored  score  function  is 


<^MC(«) 


A{2F~1(1  -  u)  -  A) 
1  +  (F-!(l  -  u)  -  A)5 
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The  inverse  function  is 


F-1(l-u)  =  tan(7r(u-i)), 


so  that  the  uncensored  score  function  also  equals 

1  +tan  [*r(u-  \)Y 

0mc(")  = : 

1  +   tan 

The  censored  score  function  is 


*(«  "  J)]  -  A 


$MC(t/)  =    2 ZE 1 V 1 )J_ 

1   —  U 


Extreme- Value  score  functions 

Unlike  the  other  density  forms  we  examined  thus  far,  the  extreme- value  form  is 
not  a  symmetric  density  function.  There  are  two  types  of  extreme-value  densities 
we  consider  here.  One  type  is 

f(x  :  A)  =  exp(x  -  A  -  e(l"A)).  (4.2.1) 

The  other  density  form  is 

f(x  :  A)  =  exp(-(i  -  A)  -  e"(l-A)).  (4.2.2) 

Note  that  these  are  just  the  distributions  of  random  variables  X  —  A  and  —  (X  —  A), 
where  X  has  the  density  f(x  :  0)  given  by  (4.2.1).  The  density  f(x  :  0)  at  (4.2.1) 
corresponds  to  the  random  variable  X  =  log  Y  where  Y  has  the  exponential  density 
form. 

For  the  density  form  (4.2.1),  the  inverse  function  is 

F-1(l-u)  =  log(-log(l-u)), 
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which  leads  to  the  uncensored  score  function 

and  the  censored  score  function 

*mev(«) = a  -  «rA_i  - 1. 

There  are  some  problems  with  the  asymptotic  properties  of  the  test  statistic  gen- 
erated by  the  score  function  corresponding  to  the  density  (4.2.1).  Specifically,  the 
integral  component  of  the  asymptotic  variance  in  Theorem  2.3.1  of  the  score  func- 
tion is  infinite  for  values  of  A  >  log  2,  thereby  violating  the  condition  (ii)  of  Theorem 
2.3.1.  Also,  the  existence  of  the  censored  score  function  is  unclear  because  condition 
(iii)  of  Theorem  3.2.1,  which  permits  exchange  of  differentiation  and  integration, 
fails  for  A  >  log  2.  For  the  mixture  model  (2.2.4),  the  asymptotic  properties  of  the 
test  statistic  u  based  on  such  score  functions  cannot  be  established  by  the  methods 
in  this  manuscript. 

Now  suppose  the  density  for  the  responders  is  as  given  at  (4.2.2): 

f(x  :  A)  =  exp(-(x  -  A)  -  e"(x-A)). 

The  uncensored  score  function  is 

<f>MEv{u)  =  exp  (A  -  e-F"'(1-u)(eA  -  1))  -  1, 

which  can  be  somewhat  simplified  by  substitution  of  the  formula 

F-1(l-u)  =  -log(-log(«)) 
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to  get 

<I>mev(u)  =  eA(u)eA-1  -  1. 

The  censored  score  function  is  found  to  be 

*>        i    \       1  - «  ,       u-ue 

1  —  tl  1  —  U 

Conover  and  Salsburg  (1988)  came  up  with  the  same  form  as  ^mev(w)  for  <f>(u) 
from  (4.2.1)  using  a  Lehmann  alternative  for  the  nonresponders  (see  our  Section 
2.2);  their  parameter  a  corresponds  to  our  eA.  Parallel  to  the  score  functions  from 
0mev(u)  and  ^mev(u)  fr°m  (4-2.1),  problems  occur  for  values  of  A  <  -log 2  = 
log  0.5  for  the  score  functions  ^mev(^)  and  $mev(i0- 

Now  consider  F(x  :  A)  to  be  of  the  Lehmann-type.  Under  the  Lehmann- 
alternative  model  (iii)  in  Section  2.2,  the  generated  score  functions  are  equivalent  to 
those  in  the  extreme-value  case  represented  by  <^mEV(")  and  $mEv(u)  fr°m  (4-2.1). 
Recall  that  these  last  score  functions  were  rejected  due  to  large-sample  behavior 
considerations.  Conover  and  Salsburg  (1988)  recommended  that  such  score  func- 
tions (derived  from  their  Model  2)  not  be  used  because  "it  appears  to  have  low 
power  in  the  examples  we  investigated."  Presumably,  the  authors  refer  to  the  em- 
pirical power  in  the  Monte  Carlo  study  they  ran  and/or  the  results  from  the  real 
data  sets  where  linear  rank  tests  were  applied  in  their  paper.  Unfortunately,  the 
paper  does  not  give  the  figures  from  the  Monte  Carlo  study.  There  was  no  mention 
of  an  infinite  variance  problem  for  certain  values  of  a,  either. 

The  Lehmann-alternative  model  corresponding  to  (iv)  in  Section  2.2  yields  the 
same  score  functions  <^>mev(u)  and  $mev(u)  derived  from  the  extreme-value  form 
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(4.2.2).  These  correspond  to  Conover  and  Salsburg's  (1988)  score  functions  arising 
from  their  Model  1. 

Therefore  in  subsequent  investigations,  the  Lehmann  model  (iv)  will  implicitly 
be  considered  under  the  extreme-value  label  that  corresponds  to  the  score  func- 
tions <Pmev(u)  and  $mev(u)  derived  from  the  density  form  (4.2.2).  The  Lehmann- 
alternative  model  (iii)  in  Section  2.2  and  the  score  functions  ^mEV(")  and  $mev(u) 
from  (4.2.1)  will  not  be  considered  at  all  since  subsequent  investigations  in  this 
manuscript  such  as  efficiencies  and  simulations  assume  that  A  >  0. 

4.3     Uncensored  Data  Expected  Scores 

In  the  literature  review  of  Section  2.3,  the  normal  distribution  uncensored  score 
function  <£>mn(")  as  reported  by  Johnson  et  al.  (1987)  was  considered.  Recall  that 
expected  scores  can  be  evaluated  by  taking  the  expectation  of  <f>(U)  where  U  is 
assumed  to  have  the  beta  density  with  parameters  i  and  N  +  1  —  i.  Since  <^mn(w) 
does  not  have  a  closed  form,  the  expected  score  c,  must  be  numerically  evaluated. 
In  the  following,  expected  scores  for  other  distributions  are  reported. 

Theorem  2.3.1  states  sufficient  conditions  for  linear  rank  tests  based  on  <j>(u) 
to  have  asymptotic  normality  properties.  Condition  (i)  stated  that  <f>(u)  must  be 
monotonic;  while  Condition  (ii)  of  Theorem  2.3.1  stipulated  that 


/  <f>2{u)du 


<  oo. 


The  existence  of  these  two  conditions  are  also  discussed  in  the  remainder  of  this 
section. 
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Logistic  expected  scores 

Obtaining  an  analytic  expression  for  the  expected  scores  requires  some  algebra 
for  the  logistic  distribution.  The  uncensored  expected  score  c,  satisfies 


(i-  l)\{N-i)\ 

m 


^/^(l-u,)^1-)-1 


1 


0 


eA  +  u,(l  -eA) 
dui 


dui 


J2  (N  7  *\ '-1  &-*-'-*-*-> 


1 


1  2 


■(iVEf'VW-'-' 


j=0 


1 

/     AT-i-1 


c?ut- 


1  +  P#)  ".J 


The  bracketed  expression  in  the  last  line  of  the  equation  can  be  represented 
as  a  quantity  involving  a  hypergeometric  function.  Gradshteyn  and  Ryzhik  (1980) 
provide  a  complete  detailed  reference  to  the  properties  of  hypergeometric  functions. 

Using  the  integral  representation  given  on  page  286  of  Gradshetyn  and  Rhyzik 
(1980), 

(i-l)!(JV-i)! 


■Ci  = 


TV! 
^)2Eq{N  jy^f-^BiN  -JA)F(2,N  -y,N  -j  +  1;1  -e^), 


where 


B(aJ)  = 


r(q)r(/?) 


T  stands  for  the  Gamma  function,  and 


F(a,0;T,z)  = 


aJ3_        a(a  +  1)0(0  +  1)   2      aja  +  l)(a  +  2)0(0  +  1)(0  +  2)  , 
7-1"         7(7  +  l)-l-2    Z    +         7(7  +  l)(7  +  2)-l-2-3        *    + 
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Straightforward  substitution  into  the  above  expression  shows  that 


so  that 


l^T 


N\ 


eAJ    (i-l)\(N-i)l 
x  £  (N  ~  ')  (-if-^BiN  -  j,  1)F(2,  N  -j;N  -j  + 1, 1  -  e"A)  -  1 

j=0   \     J      J 


E  t#tJW  +  i)(i  -  «~Y 


- 1 


is  the  expected  score  corresponding  to  X(,j,  i  =  1, . . . ,  N  for  a  logistic  distribution. 
For  the  conditions  of  Theorem  2.3.1,  we  have 


«V(«)  =  eA(2)(eA-l)[eA  +  u(l-eA)]- 


which  is  positive  valued  for  all  u  £  (0, 1),  so  the  monotonicity  condition  (i)  holds. 

l 
The  expression  for  /  ^L(u)du  is 
o 

(eA  ~  I)2 
3eA 

when  A  >  0,  and  is  finite.  Hence  condition  (ii)  of  Theorem  2.3.1  is  satisfied. 


Double- Exponential  expected  scores 


Evaluation  over  the  three  regions  of  the  score  function  gives 
f  e~A  -  1,  if  0  <«,•<£: 


c,  =  < 


2> 


N{N-1)       ez*  _  1        if  I<u.<1_Ie-A. 
-i)(/V-l-i)     4  x'       "j^"'-1         2e        ' 


{N-i)(N-\-i) 


{  eA  -  1,  if  1  -  \e~A  <  u,  <  1. 
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Checking  the  monotonicity  condition  (i)  in  Theorem  2.3.1  yields 

0,  if0<u<l/2; 

*mde(«)  =  pr(!  "  ")~31    if  1/2  <  «  <  1  -  l/2(c-A); 

.0,  if  1 -l/2(e"A)  <u<  1. 

Thus  condition  (i)  is  verified.  For  this  distribution, 

i 

|^DE(u)(/u  =  ic-aA  +  |eA-l, 

o 

when  A  >  0.  Hence  condition  (ii)  of  Theorem  2.3.1  is  also  verified. 

Cauchy  expected  scores 

The  expected  score  requires  numerical  evaluation.  The  integration  expression 
written  in  the  normal  case  above  can  be  employed. 

The  uncensored  score  function  <f>Mc(u)  is  not  strictly  monotonic,  so  Theorem 
2.3.1  cannot  be  applied.  Since  the  condition  (i)  is  stated  as  a  sufficient  condition, 
perhaps  an  alternative  condition  allows  the  same  results  for  the  Cauchy  distribution. 
Indeed,  the  theory  developed  in  Chapter  Three  for  potentially  censored  data  can 
be  applied  (cf.  Section  4.4)  to  achieve  the  results  of  Theorem  2.3.1  for  the  Cauchy 
distribution.  We  have  for  A  >  0: 


f  A2 

J  <£mc(")  =  y, 


so  that  condition  (ii)  of  Theorem  2.3.1  is  clearly  satisfied,  and  can  be  used  in  a 
variance  expression  for  the  asymptotic  distribution  of  the  test  statistic. 
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Extreme- Value  expected  scores 

Note  that  the  Lehmann-alternative  model  (iv)  in  Section  2.2  is  implicitly  consid- 
ered here  by  our  discussion  of  extreme- value  score  functions  in  Section  4.2.  Straight- 
forward integration  yields 

N\      T(eA  +  t  -  1) 
C'~  (t-1)!    T(N  +  e*)     ~L 

If  eA  is  integer- valued,  we  get  the  further  simplification 

N{N-l)---(i) 

which  matches  the  result  obtained  by  Conover  &  Salsburg  (1988)  under  their 
Lehmann-type  mixed  Model  1. 

The  first  derivative  of  <^>mev(")  is 

<?mev(")  =  eA(eA  "  IK*"2, 

which  takes  on  strictly  positive  values  over  (0,1).   Thus  condition  (i)  of  Theorem 

2.3.1  is  satisfied.  The  expression  for 

i 

/^MEv(")d" 
0 

when  A  >  0  is 

e2A 

-1, 


2e*-l 
so  that  condition  (ii)  of  Theorem  2.3.1  also  holds. 

4.4     Censored  Data  Expected  Scores 

We  now  consider  evaluation  of  expected  scores  when  censored  data  is  present 
under  the  model  (2.2.4)  and  the  random  censorship  model  of  Section  2.2.    The 
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derived  density  form  g'{u)  at  (3.5.4)  in  the  previous  chapter  is  employed  for  both 
the  uncensored  and  censored  expected  scores. 

Recall  that  in  Theorem  3.3.1  there  were  three  conditions  given  that  are  sufficient 
for  the  approximate  uncensored  and  censored  scores  to  converge  in  probability  as 
N  — ►  oo  to  the  expected  scores  c,  and  C„  respectively.  It  was  pointed  out  in  the 
discussion  after  presentation  of  Theorem  3.3.1  that  condition  C-3  follows  from  the 
variance  expression  (3.4.11)  obtained  in  Theorem  3.4.3.  Condition  C-l  refers  to 
the  existence  and  continuity  of  the  first  and  second  derivatives  of  the  uncensored 
score  function  <f>(u)  over  (0,1).  Part  of  C-l  has  already  been  indirectly  addressed 
in  Section  4.3  when  the  monotonicity  condition  (i)  of  Theorem  2.3.3  was  discussed. 
In  order  to  investigate  the  existence  of  the  conditions  C-l  and  C-2,  the  derivatives 
<j>'{u)  and  4>"(u)  are  examined  for  each  of  our  chosen  distributional  forms  of  F(x-A). 

Another  key  point  that  is  investigated  below  is  the  boundedness  of  the  weight 
function  w(x)  at  (3.4.2).  Suppose  Ai  and  A2  are  positive  constants.  The  sufficient 
conditions  given  in  Lemma  3.4.1  are  equivalent  to  the  uncensored  score  function 
(f>(u)  satisfying 

lim  <f>(u)  =  A\\         lim  <f>(u)  =  A2, 

providing  us  with  another  means  to  check  these  conditions.  Lemma  3.4.1  and  the 
continuity  of  F(x  -  A)  were  sufficient  for  w(x)  to  be  bounded. 
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Normal  expected  scores 

By  nature  of  the  score  functions  <f>(ut)  and  $(«,),  taking  their  expectation  to  find 
c,  and  Ci%  respectively,  requires  numerical  integration  with  respect  to  the  density 

(3.5.4). 

Condition  C-2  of  Theorem  3.3.1  will  not  be  satisfied,  as  both  the  first  and  second 
derivatives  of  <f>(u)  are  unbounded  as  u  -,  1.  Of  course,  the  approximate  scores  are 
a  much  more  practical  choice  anyhow  for  the  test  statistic  from  a  computational 
standpoint. 

The  limit  of  the  weight  function  w{x)  at  (3.4.2)  as  x  -  -oo  is  -1,  but  the  limit 
as  x  -  oo  is  another  matter.  Both  score  functions  are  unbounded  as  x  grows  large 
or  equivalents,  as  «  -  1.  L'Hopital's  rule  cannot  be  applied,  as  all  derivatives  of 
f(x  :  A)  and  f(x)  grow  infinitely  large  in  the  limit  as  well. 

Hence  the  normal  distribution  is  not  directly  covered  by  Theorem  3.4.3.  Gill 
(1980)  encountered  a  similar  situation  for  the  location-shift  model  when  the  under- 
lying distribution  is  normal.  (Refer  to  his  Example  5.3.4,  p.  127-128.)  In  that  case, 
the  corresponding  weight  function  to  w*(x)  for  the  location-shift  model  was 

'  fix)  F(xY 
The  troublesome  point  is  x  =  -oo.  However,  his  (page  128-134)  proposals  deal 
with  this  in  order  to  successfully  derive  the  asymptotic  normality  properties  that 
would  be  expected  under  the  null  hypothesis.  The  proposals  can  be  adapted  for  our 
w(x)  for  the  mixed  model  at  (2.2.4).  The  reader  is  referred  to  Gill  (1980,  p.127-134) 
for  details. 
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Logistic  expected  scores 

Using  the  density  g'(ui)  at  (3.5.4),  an  uncensored  expected  score  will  involve 
the  integration 


1 


eA  +  (1  -  eA)u, 


du{ 


Ai-u,-)*'-1 

Jo 

=  (~)    B(l,R:)F(2,l;RJ  +  l;l-e-*) 


du{ 


1 


2-1 


"{->)B^V  +  (R,  +  i)-i 


(1-e"*) 


+ 


2-3-21 


;(l-02  + 


(^•  +  l)(i?i  +  2)-l-2' 


by  the  same  techniques  surrounding  the  hypergeometric  function  expounded  upon 
in  the  logistic  example  of  Section  4.3.  Thus  the  expected  score  is 


c,     =     e 


2  /    • 


3    n^ 


\j'-i 


"E- 


1=1 


n^ 


w'=i 


l,'    "V»{g(n8Bfc-^}-» 

i=i 
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=  in* 


U'=l 


Et 


g(*  +  'p)'W-1)'(1-«-Y}-i. 


(i?i  +  A)! 


Similarly,  the  censored  expected  score  turns  out  to  be 


Q  =  i(n*>)£- 


■f  *^LL|2!(I  _  £-*n  _  , 


U<=1        /  ,«l    jpj  (#,  _   D.\    U=0      (#i  +  ^)! 

1=1 

The  score  function  0ml(")  is  defined  and  continuous  over  [0,1].     Thus  w(x)  is 
bounded  for  all  x,  and  furthermore 

2(1  -eA) 


[eA  +  (1  -  e*)u]3 
6(1 -eA)2 


[eA  +  (1  _  eA)„]4 

so  that  conditions  C-l  and  C-^  of  Theorem  3.3.1  are  satisfied. 


Double-Exponential  expected  scores 


The  beta  function  is  involved  with  the  expectations. 
I.  If  0  <  Ui  <  1/2,  then 

a  =  e~A  -  1 


and 


g  -  u-^fii^ii:- — - — 5(2,^-1). 


i=i 


II.  If  1/2  <  ut  <  l-(l/2)e-A,then 


ft     = 


,-A 


C-J-)  urae- 


■B(1,Rj-2) 


1=1 


-1, 
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and 

1 


d  =   n^E- 


1=1 


e~A' 


B(2,Rj-l)-[  —  \B(ltRf-2) 


III.  If  1  -(l/2)e-A  <u, ,<  1,  then 

c,  =  Ci  =  eA  -  1. 

The  structure  for  the  regions  I  and  III  above,  along  with  the  monotonicity  of 
(f>(u)  immediately  insures  that  w(x)  is  bounded  and  that  conditions  C-l  and  C-2 
of  Theorem  3.3.1  obtain. 

Cauchy  expected  scores 

As  with  the  normal  distribution,  the  nature  of  the  score  functions  4>mc{u)  and 
$mc(u)  suggests  numerical  integration  for  the  expectation  over  (3.5.4). 
A  double  application  of  L'Hopital's  rule  for  evaluating  each  limit  gives 

lim  Z!iii|) .  um  £^>  =  I  =  1, 

so  that  iw(i)  =  0  at  the  points  — oo  and  oo.  Then  application  of  Lemma  3.4.1  is 
appropriate,  as  w(x)  is  bounded  for  all  x. 

The  first  and  second  derivatives  of  <f>(u)  are  messy  expressions,  but  indicate  that 
conditions  C-l  and  C-2  of  Theorem  3.3.1  are  satisfied. 

The  Cauchy  distribution  is  an  example  where  Theorem  3.4.3  is  more  generally 
applicable  than  Theorem  2.3.1  when  there  is  no  presence  of  censoring  in  the  data. 
Specifically,  the  monotonicity  condition  (i)  in  Theorem  2.3.1  of  the  score  function 
<Pmc{u)  is  relaxed. 
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Extreme- Value  expected  scores 

The  integration  produces  a  beta  function  again.  This  gives  the  expressions 

C    =    e'lflR^t-r-l B(e\R:)-l 

V'=i     /  i=i  n  (Ri  -  Rj) 


1=1 


and 


\J'ml         J  j=l    Yl  (ty  -  Nj) 


1=1 

1*1 


Clearly  <^mev(w)  is  bounded  on  [0, 1],  and  since  the  first  and  second  derivatives 
are  proportional  to  the  score  function,  conditions  C-l  and  C-2  hold.  Additionally, 
w(x)  is  bounded  on  the  extended  real  line  since 

,.       f(x  -  A)  f(x  -  A)        A 

lim  — ??~r_  =  °:       hm     //  \     = e  • 

r-.-oo         y(x)  *— °°         /(x) 


CHAPTER  5 
COMPARATIVE  STUDIES 


The  results  of  the  last  two  chapters  enable  us  to  evaluate  the  performance  of  lin- 
ear rank  test  statistics  derived  from  the  mixture  model  at  (2.2.4).  Since  the  scores 
of  these  linear  rank  statistics  depend  on  the  parameter  A,  Pitman  asymptotic  rel- 
ative efficiencies  can  be  used  to  help  chose  a  value  of  A  to  conduct  the  test.  In  the 
literature  review  of  Section  2.3,  an  example  of  the  Pitman  asymptotic  relative  effi- 
ciency for  misspecified  A  in  the  case  of  a  mixed  normal  distribution  was  reported. 
Included  in  that  example  was  another  efficiency  comparison  of  normal  scores  from 
the  standard  location-shift  model  relative  to  mixed-model  scores  when  the  underly- 
ing distribution  is  mixed  normal.  Now  we  can  extend  these  types  of  comparisons  to 
other  distributions  and  allow  for  censoring.  The  resulting  efficiencies  are  reported 
in  Section  5.1. 

Finite  sample  comparisons  involving  misspecified  A  in  the  mixed-model  scores 
and  comparisons  of  mixed-model  scores  vs.  location-shift  model  scores  arise  from 
simulations  that  were  conducted  and  are  discussed  in  Section  5.2.  A  variety  of  cen- 
soring mechanisms  and  underlying  distributional  forms  are  considered  to  evaluate 
the  performance  of  the  mixed-model  tests  and  the  location-shift  model  tests.  Some 
recommendations  on  scores  to  use  in  practice  are  presented  here.      Based  on  these 
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recommendations,  performance  of  selected  tests  on  an  uncensored  data  example  and 
a  censored  data  example  is  illustrated  in  Section  5.3. 

5.1     Pitman  Asymptotic  Relative  Efficiencies 

It  is  again  noted  that  ARE  is  an  abbreviation  for  Pitman  asymptotic  relative 
efficiency  in  this  manuscript.  The  usage  of  the  lone  word  "efficiency"  implicitly 
refers  to  Pitman  asymptotic  relative  efficiency. 

Recall  that  for  uncensored  data,  Theorem  2.3.2  gives  a  formula  to  be  used  to 
evaluate  ARE  for  some  particular  distributions  when  the  unknown  parameter  is 
misspecified.  This  was  reiterated  in  scenario  I  (3.6.4)  of  efficiency  expressions  in 
Section  3.6.  In  an  example  in  Section  2.3,  we  also  discussed  the  use  of  ARE  to 
compare  the  performance  of  the  mixed-model  scores  test  with  the  standard  location- 
shift  scores  test.  At  that  point,  the  normal  distribution  was  used  as  an  illustration. 

Because  the  efficiency  expressions  derived  in  Section  3.6  under  the  random  cen- 
sorship model  are  generalizations  of  the  uncensored  data  efficiency  expression  in 
Theorem  2.3.2,  the  ARE  evaluations  for  other  distributional  forms  that  consider 
both 

1.  No  censoring,  and 

2.  Various  censoring  patterns, 

will  be  reported  together.  To  summarize,  two  types  of  ARE  are  evaluated: 
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1.  Misspecified  A,  and 

2.  Misspecified  model. 

Description  of  censoring  patterns 

For  all  ARE  evaluations,  the  censoring  distributions  were  assumed  to  be  equal 
for  both  the  control  group  and  the  treatment  group.  A  formula  for  efficiency  is 
given  at  (3.6.9).  Under  random  censorship,  the  two  selected  censoring  distributions 
for  the  survival  times  were 

1.  The  distribution  of  the  natural  logarithm  of  an  exponential  random  variable 
with  hazard  rate  A,  i.  e.,  the  extreme-value  distribution 

L(x)  =  1  -  exp(x  -  log(l/A)  -  e^sO/A))).      _oo  <  x  <  oo,     A  >  0. 

2.  The  distribution  of  the  natural  logarithm  of  a  uniform  random  variable  with 
support  (0,  M),  i.  e., 

L(x)  =  1  -  —  ex;      -oo  <  x  <  logA/,      M  >  0. 

Values  of  the  parameters  A  and  M  can  be  selected  to  give  desired  censoring  rates 
for  the  control  and  treatment  groups  under  the  null  hypothesis  H0  :  7r  =  0.  For  our 
ARE  evaluations  below,  censoring  rates  of  10%,  25%  and  40%  were  selected. 

Including  the  uncensored  case,  i.e.  a  0%  censoring  rate,  ARE  evaluations  for 
seven  different  censoring  patterns  for  a  given  survival  distribution  were  computed 
for  the  misspecified  A  investigations  and  the  misspecified  model  investigations.  Se- 
lected ARE  evaluations  that  are  indicative  of  some  observed  patterns  are  reported 


120 

below.  For  uncensored  data,  a  closed  form  expression  of  both  types  of  ARE  was  al- 
ways obtained.  In  the  presence  of  censoring  however,  the  ARE  expressions  generally 
required  numerical  integration. 

Normal  distribution 

To  illustrate  the  full  range  of  ARE  evaluations  that  were  performed  consider 
Tables  5.1  and  5.2  below.  Recall  that  the  ARE  evaluations  for  uncensored  data  are 
given  in  Tables  2.1  and  2.2  in  Section  2.3  of  this  manuscript. 

In  Table  5.1,  At  refers  to  the  true  value  of  A  in  the  underlying  distribution,  while 
Ac/i  refers  to  the  chosen  value  in  the  scores  for  conducting  the  mixed-model  test.  In 
comparison  to  Table  2.1,  the  structure  of  Table  5.1  is  abbreviated  in  some  aspects. 
The  first  aspect  is  that  only  values  of  A  in  the  range  of  1.0-3.0  are  investigated, 
since  evidence  in  the  literature  and  research  done  for  this  dissertation  indicates 
that  the  mixed-model  tests  are  not  very  suitable  for  underlying  distributions  with 
A  <  1.0  values.  The  second  aspect  is  that  for  the  situation  where  At  =  A^,  the 
ARE  attains  the  maximum  value  of  1.0  for  all  censoring  patterns  (including  the 
uncensored  case);  so  this  place  is  omitted  for  the  table.  Lastly,  the  efficiency  values 
in  Table  5.1  are  ''symmetric"  in  the  sense  that  the  ARE  is  the  same  whether  the 
value  of  A  is  underspecified  or  overspecified.  Thus  the  blank  spaces  in  the  the  ith 
column  and  jth  row  can  be  filled  in  with  the  efficiency  value  from  the  jth  column 
and  ith  row. 

There  are  two  patterns  of  the  ARE  values  in  Table  5.1  that  appear  common  to 
all  the  distributions  that  are  selected  in  the  remainder  of  this  section. 
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Table  5.1:      Pitman  ARE  for  misspecified  A  under  a  mixed  normal  distribution  with 
exponential  and  uniform  (in  parentheses)  censoring  patterns 


icA 


Censoring  rate  is  10% 

A, 


1.5 


2.0 


2.5 


3.0 


1.0 

.891  (.916) 

.642  (.743) 

.397  (.580) 

.226  (.465) 

1.5 

.900  (.940) 

.680  (.824) 

.462  (.717) 

2.0 

.917  (.964) 

.736  (.897) 

2.5 

.935  (.981) 

Censoring  rate  is  25% 

Ac/l 

1.5 

2.0 

2.5 

3.0 

1.0 

.927  (.956) 

.761  (.871) 

.584  (.794) 

.441  (.744) 

1.5 

.940  (.975) 

.808  (.931) 

.666  (.894) 

2.0 

.955  (.988) 

.857  (.969) 

2.5 

.968  (.996) 
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Table  5.1:     continued 


Censoring  rate  is  40% 
A, 


Aca 

1.5 

2.0 

2.5 

3.0 

1.0 

.949  (.974) 

.836  (.928) 

.717  (.892) 

.622  (.872) 

1.5 

.962  (.988) 

.883  (.969) 

.462  (.957) 

2.0 

.975  (.996) 

.925  (.990) 

2.5 

.985  (.999) 

1.  As  the  censoring  rate  rises,  the  penalty  for  misspecifying  A  decreases.  (Com- 
parison of  Table  5.1  with  the  0%  censoring  rate  of  Table  2.1  indicates  this 
feature  as  well.) 

2.  The  ARE  values  under  uniform  censoring  are  at  least  as  large  as  the  ARE 
values  under  exponential  censoring. 

Notice  for  the  mixed  normal  distribution  that  the  ARE  values  under  uniform  cen- 
soring are  clearly  higher  than  the  ARE  values  under  exponential  censoring. 

Table  5.2  addresses  model  misspecification  by  displaying  ARE  values  of  using 
the  location-shift  model  test  instead  of  the  mixed-model  test.  The  location-shift 
model  test  can  be  regarded  as  a  censored  data  extension  of  the  normal  scores  test. 
As  in  Table  5.1,  the  feature  of  less  penalty  in  ARE  terms  as  the  censoring  rate 
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Table  5.2:  Pitman  ARE  for  location- model  vs.  mixed-model  linear  rank  test, 
mixed  normal  distribution  with  exponential  and  uniform  (in  paren- 
theses) censoring  patterns 


Censoring  rate  is  10% 
A  for  mixed-model  score 


1.0  1.5  2.0  2.5  3.0 

0.634  (0.650)     0.363  (0.405)     0.173  (0.236)     0.074  (0.143)     0.030  (0.095) 


Censoring  rate  is  25% 
A  for  mixed-model  score 


1.0  1.5  2.0  2.5  3.0 

0.693  (0.729)     0.466  (0.550)     0.292  (0.421)     0.180  (0.344)     0.117  (0.302) 


Censoring  rate  is  40% 
A  for  mixed-model  score 


1.0  1.5  2.0  2.5  3.0 

0.741  (0.789)     0.554  (0.658)     0.405  (0.567)     0.304  (0.515)     0.243  (0.490) 
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increases  is  evident.  A  comparison  of  Table  5.2  with  the  0%  censoring  rate  of  Table 
2.2  is  consistent  with  this  pattern  as  well. 

Logistic  distribution 

When  there  is  no  censoring,  evaluation  of  the  efficiency  expression  in  Theorem 
2.3.2  or  at  (3.6.4)  gives 

mA      A  ,      9eA'+^[2eA'+^(At  -  Aeh)  +  e2^  -  e2*f 
eff(A*,A0  -  (eAc,_eA,)6 

for  the  mixed-model  logistic  scores  and  the  misspecification  of  A  problem.  This 
last  expression  is  arrived  at  by  straightforward  but  tedious  algebra.  Substituting 
Ac/,   =  At  ±  c,  where  c  >  0,  the  last  efficiency  expression  for  uncensored  data 

becomes 

<r,A       ax       9e'cf[e2"cl  —  2|c|e'c'  —  l]2 

eff(Ac„A,)=  (eld -l)a ~' 

Thus  the  ARE  for  misspecified  A  only  depends  on  the  absolute  difference  between 
At  and  Ac/,,  and  not  on  the  actual  values  themselves.  Table  5.3  displays  the  ARE 
values  for  some  selected  values  of  \c\. 

The  efficiencies  for  misspecified  A  when  the  data  are  subject  to  censoring  do 
not  exhibit  the  same  feature  as  that  for  the  uncensored  data.  Table  5.4  contains 
ARE  values  for  a  25%  censoring  rate.  The  values  are  very  similar  under  exponential 
or  uniform  censoring  for  the  10%  and  40%  censoring  rates  as  well.  As  mentioned 
in  the  mixed-normal  example,  the  characteristic  of  the  penalty  of  misspecifying  A 
decreasing  as  the  censoring  rate  goes  up  also  occurs  for  the  mixed-logistic  disrtibu- 
tion. 
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Table  5.3:     Pitman  ARE  for  misspecified  A  under  a  mixed  logistic  distribution  with 
no  censoring 


\c\ 

ARE 

0.0 

1.000 

0.5 

0.963 

1.0 

0.862 

1.5 

0.720 

2.0 

0.565 

Table  5.4:     Pitman  ARE  for  misspecified  A  under  a  mixed  logistic  distribution  with 
exponential  or  uniform  (in  parentheses)  censoring  patterns 


Censoring  rate  is  25% 
At 


A^ 

1.5 

2.0 

2.5 

3.0 

1.0 

.967  (.976) 

.911  (.920) 

.828  (.853) 

.743  (.792) 

1.5 

.979  (.982) 

.926  (.941) 

.859  (.897) 

2.0 

.983  (.988) 

.942  (.962) 

2.5 

.987  (.993) 
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Table  5.5:  Pitman  ARE  for  location-model  vs.  mixed-model  linear  rank  test, 
mixed  logistic  distribution  with  absence  or  presence  of  exponential  and 
uniform  (in  parentheses)  censoring  patterns 


Censoring  rate  is  0%  (No  Censoring) 
A  for  mixed-model  score 


1.0 

1.5 

2.0 

2.5 

3.0 

0.862 

0.720 

0.565 

0.420 

0.297 

Censoring  rate  is  25% 
A  for  mixed-model  score 


1.0  1.5  2.0  2.5  3.0 

0.885  (0.887)     0.773  (0.780)     0.655  (0.673)     0.549  (0.582)     0.463  (0.514) 

Table  5.5  reports  results  for  model  misspecification.  The  location-shift  model 
test  is  the  Peto-Peto  generalization  of  the  Wilcoxon  rank  sum  test.  Table  5.5  indi- 
cates that  for  uncensored  data,  the  Wilcoxon  performs  relatively  well  to  the  mixed- 
model  test  for  values  of  A  up  to  1.5.  Again,  the  penalty  for  model  misspecification 
decreases  for  the  ARE  as  the  censoring  rate  increases.  The  ARE  values  are  similar 
under  exponential  or  uniform  censoring. 

Extreme- Value  distribution 

Table  5.6  displays  ARE  values  for  misspecified  A.  The  uncensored,  the  10% 
censoring  rate,  and  the  40%  censoring  rate  cases  exhibit  the  same  patterns  and 
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Table  5.6:     Pitman  ARE  for  misspecified  A  under  a  mixed  extreme-value  distribu- 
tion with  exponential  and  uniform  (in  parentheses)  censoring  patterns 


Censoring  rate  is  10% 
A, 


Ac/l 

1.5 

2.0 

2.5 

3.0 

1.0 

.927  (.929) 

.769  (.782) 

.604  (.640) 

.472  (.547) 

1.5 

.940  (.946) 

.808  (.838) 

.667  (.743) 

2.0 

.951  (.963) 

.844  (.897) 

2.5 

.962  (.980) 

are  thus  not  included.  In  Section  4.2,  we  mentioned  that  the  uncensored  score 
function  (pMEv(u)  was  derived  under  a  Lehmann-alternative  type  of  mixture  model 
by  Conover  and  Salsburg  (1988).  Their  paper  based  the  recommendation  of  scores 
with  a  =  5  or  equivalently,  A  =  log(5)  «  1.609,  on  efficiencies  computed  for  model 
misspecification.  The  location-shift  model  test  used  by  Conover  and  Salsburg  (1988) 
is  based  on  the  extreme- value  density  at  (4.2.2),  which  we  used  to  generate  <W(«) 
and  $mev(u).  This  location-shift  model  test  of  Conover  and  Salsburg  (1988)  consists 
of  "inverse  Savage  scores,"  as  opposed  to  the  Savage  (1956)  scores  that  comprise 
the  uncensored  data  version  of  the  log-rank  statistic. 

Since  the  log-rank  test  statistic  is  commonly  used  for  comparing  groups  with 
respect  to  survival  data,  Table  5.7  displays  ARE  values  for  comparing  the  log- 
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rank  test  to  the  mixed-model  test  based  on  ^mev(u)  and  $mev(u).  In  computing 
the  ARE  values,  the  underlying  survival  distribution  was  assumed  to  be  of  the 
form  (4.2.2).  It  is  noted  however,  that  although  values  are  not  displayed  here,  the 
calculations  with  the  assumption  of  the  extreme- value  density  form  at  (4.2.1)  leads 
to  very  similar  ARE  values.  The  maximum  ARE  values  for  model  misspecification 
appear  to  occur  for  values  of  A  between  1.5  and  2.0,  which  is  consistent  with  the 
findings  of  Conover  and  Salsburg  (1988).  Table  5.7  also  suggests  that  the  log-rank 
is  very  competitive  with  the  mixed-model  based  test  over  the  range  of  A  values 
1.0-3.0. 

Double-Exponential  and  Cauchy  distributions 

The  efficiency  values  for  misspecified  A  or  misspecified  model  exhibit  the  same 
patterns  common  to  the  previous  three  distributions,  which  are  again  summarized 
as  follows: 

1.  As  the  censoring  rate  increases,  the  penalty  for  misspecifying  A  decreases. 

2.  The  ARE  values  under  uniform  censoring  are  at  least  as  large  as  the  ARE 
values  under  exponential  censoring. 

5.2     Simulation  Study 

In  order  to  investigate  the  behavior  of  the  mixed-model  tests  for  fixed  sample 
sizes,  a  simulation  study  was  conducted.  The  results  of  Section  3.4  provide  the 
asymptotic  distribution  for  a  standardized  version  of  a  score  statistic  v  that  can  be 
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Table  5.7:  Pitman  ARE  for  location-model  vs.  mixed-model  linear  rank  test, 
mixed  extreme-value  distribution  with  absence  or  presence  of  expo- 
nential and  uniform  (in  parentheses)  censoring  patterns 


Censoring  rate  is  0%  (No  Censoring) 
A  for  mixed-model  score 


10  1.5  2.0  2.5  3.0 


0.846  0.922  0.912  0.838  0.737 


Censoring  rate  is  10% 
A  for  mixed-model  score 


L0  1-5  2.0  2.5  3.0 


0.925  (0.944)     0.973  (0.982)     0.931  (0.931)     0.840  (0.844)     0.737  (0.767) 


Censoring  rate  is  25% 
A  for  mixed-model  score 


!-0  .    1-5  2.0  2.5  3.0 


0.962  (0.981)     0.982  (0.984)     0.933  (0.934)     0.852  (0.892)     0.737  (0.878) 


Censoring  rate  is  40% 
A  for  mixed-model  score 


10  1-5  2.0  2.5  3.0 


0.981  (0.995)     0.983  (0.983)     0.933  (0.954)     0.880  (0.942)     0.848  (0.941) 
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used  to  test  the  null  hypothesis  H0  :  is  =  0  in  the  mixture  model  given  by  (2.2.4). 
The  simulations  in  this  section  enable  the  study  of  how  the  powers  of  tests  based 
on  scores  generated  from  different  forms  of  F(x  :  A)  compare  with  each  other  and 
compare  with  the  standard  censored  data  tests  that  do  not  assume  a  mixture  model 
but  instead  a  location-shift  model.  The  effect  of  various  censoring  distributions  on 
the  test  procedures  is  also  of  interest. 

Description  of  the  Simulation  study 

For  the  mixture  model  at  (2.2.4),  data  for  the  treatment  group  was  generated  by 
sampling  Ni  values  from  F(x),  also  sampling  iV"i  values  from  F(x  :  A),  and  choosing 
the  jth  observation  from  either  the  F(x)  sample  or  the  F(x  :  A)  sample  based  on 
the  outcome  of  a  0  or  1,  respectively,  from  an  independent  Bernoulli  random  variable 
R  with  probability  of  success 

7r  =  Pr(7?=  1), 

as  in  (2.2.1).  For  the  control  group,  an  independent  random  sample  of  N0  observa- 
tions from  F(x)  was  obtained. 

For  data  subject  to  random  right  censoring,  independent  random  samples  for  the 
treatment  group  and  the  control  group  were  generated  from  the  same  distribution 
characterized  by  L(x).  The  jth  observation  from  L(x)  in  a  group  was  matched 
up  with  the  jth  observation  from  (2.2.4)  as  described  in  the  previous  paragraph  in 
order  to  determine  whether  or  not  censoring  had  occurred. 


131 

The  significance  level  was  fixed  at  a  =  0.05  throughout  the  study.  Sample  sizes 

of  NQ  =  N:  =  20  and  N0  =  Nx  =  50  were  selected.   Three  survival  distributions 

for  the  logarithm  of  the  survival  time  were  chosen.    Note  that  all  have  support 

(-co,  oo). 

1.  The  normal  distribution,  characterized  by 

2.  The  Cauchy  distribution, 

F(x  :  A)  =  -  -  arctan(x  -  A). 

3.  The  extreme-value  distribution, 

F(x  :  A)  =  exp(-e(*-A)). 

These  three  choices  of  survival  distributions  enable  evaluation  of  performance  by 
the  tests  for  medium  and  heavy  tailed  distributions  as  well  as  a  distribution  with 
moderate  departure  from  symmetry. 

The  two  censoring  distributions  (as  in  Section  5.1)  selected  were: 

1.  The  distribution  of  the  natural  logarithm  of  an  exponential  random  variable 
with  hazard  rate  A,  i.  e.,  the  extreme-value  distribution 

L{x)  =  1  -  exp(x  -  log(l/A)  -  e(-MiA))).      _00  <  x  <  0Oj     X>Q 

2.  The  distribution  of  the  natural  logarithm  of  a  uniform  random  variable  with 
support  (0,  M),  i.  e., 

L^  =  l  ~  MgX'      -°°<x  <\°gM,     M>0. 
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These  two  censoring  distributions  have  been  employed  to  model  the  censoring  mech- 
anism on  data  in  a  clinical  trial  with  random  starting  times  for  subjects.  Values 
of  the  parameters  A  and  M  were  selected  in  the  study  to  give  censoring  rates  of 
10%,  25%,  and  40%  under  the  null  hypothesis  7r  =  0.  The  situation  of  no  censoring, 
i.e.  0%  censoring  rate,  is  also  considered.  Note  that  our  results  for  the  0%  censoring 
rate  extend  the  results  of  the  simulation  by  Johnson  et  al.  (1987). 

All  possible  combinations  of  n  =  0.1(0.1)0.3  and  A  =  1(1)3  were  considered  for 
parameter  values  in  the  mixture  model  at  (2.2.4).  Including  the  null  case  x  =  0, 
this  yielded  10  parameter  settings  for  each  possible  combination  of  survival  form 
F(x  :  A),  censoring  form  L(x),  and  censoring  rate.  A  configuration  in  our  study 
represents  a  combination  of 

1 .  A  value  of  ir 

2.  A  value  of  A 

3.  The  form  of  F(x  -  A) 

4.  The  form  of  L(x) 

5.  The  censoring  rate 

6.  The  sample  sizes  No  and  Nx. 

In  total,  360  configurations  to  generate  data  under  (2.2.4)  were  considered. 

Random  number  functions  in  S-plus,  Version  2.3  (Becker,  Chambers,  and  Wilks, 
1989;  Statistical  Sciences,  1990)  generated  the  data  as  described  above.  As  a  means 
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of  checking  the  validity  of  the  random  sample  generator  capabilities  of  S-plus,  we 
ploted  several  q-q  plots  for  our  samples.  Figures  5.1  and  5.2  show  two  such  plots- 
one  for  a  sample  from  a  normal  distribution  and  the  other  for  a  sample  from  an 
extreme-value  distribution.  The  tendency  of  the  data  points  to  fall  on  a  straight 
line  with  slope  unity  in  both  plots  is  evidence  that  the  intended  distribution  for  the 
data  does  hold. 

At  each  configuration,  care  was  taken  to  make  sure  that  a  different  seed  was 
used  to  start  the  random  number  generation  process.  After  a  sample  of  observations 
and  censoring  indicators  for  a  given  configuration  were  generated,  a  myriad  of  test 
statistics  were  computed  and  compared  to  the  standard  normal  critical  value  of 
1.96.  The  normal  approximation  (Theorem  3.4.3)  was  always  used.  If  the  value  of  a 
test  statistic  exceeded  1.96,  the  null  hypothesis  of  tt  =  0  was  rejected;  otherwise  the 
null  hypothesis  was  not  rejected.  Reference  to  a  "test  statistic"  for  the  remainder 
of  this  chapter  means  the  standardized  version  of  the  linear  rank  statistic. 

The  test  statistics  included  the  log-rank,  Gehan's  generalized  Wilcoxon  (1965), 
and  the  Peto-Peto  generalized  Wilcoxon  version  (1972).  When  the  underlying  sur- 
vival distribution  was  normal,  an  extended  version  of  the  normal  scores  test  statistic, 
that  uses  the  censored  score  functions  derived  from  the  methods  of  Prentice  (1978), 
was  also  computed.  Test  statistics  based  on  mixed-model  scores  derived  from  each 
of  the  normal,  logistic,  extreme-value,  double-exponential,  and  Cauchy  forms  of 
F(x  :  A)  were  also  computed.  Approximate  scores  that  consisted  of  substituting 
the  survival  function  estimate  Fp{x)  (see  3.3.2)  into  the  uncensored  and  censored 
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Figure  5.1:    1000  random  observations  from  a  normal  distribution 
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Figure  5.2:    1000  random  observations  from  an  extreme-value  distribut 
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score  functions  were  always  used.  The  variance  estimator  V2  was  always  used.  For 
configurations  with  no  censoring,  an  "exact"  expression  for  the  variance  could  have 
been  evaluated  (cf.  Theorem  2.3.1)  and  employed.  However,  since  such  a  variance 
expression  (cf.  (3.4.11)  in  Theorem  3.4.3)  generally  depends  on  the  censoring  distri- 
butions, the  estimator  V2  was  used  for  the  purposes  of  uniformity.  Since  the  scores 
depend  on  a  value  of  A,  scores  with  values  of  A  =  1,2,  and  3  were  calculated.  In 
summary,  the  following  statistics  were  calculated  for  each  configuration: 

•  log-rank,  denoted  by  LG 

•  Gehan,  GH  (or  W  for  Wilcoxon  when  no  censoring  occurs) 

•  Peto-Peto,  PP  (equals  GH  =  W  when  no  censoring  occurs) 

•  normal  scores,  NS  (only  calcualted  when  F  is  normal) 

•  mixed  normal  with  A  =  1,  denoted  by  MN(1) 

•  mixed  normal  with  A  =  2,  MN(2) 

•  mixed  normal  with  A  =  3,  MN(3) 

•  mixed  logistic  with  A  =  1,  ML(1) 

•  mixed  logistic  with  A  =  2,  ML(2) 

•  mixed  logistic  with  A  =  3,  ML(3) 

•  mixed  extreme- value  with  A  =  1,  MEV(l) 

•  mixed  extreme-value  with  A  =  2,  MEV(2) 
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•  mixed  extreme-value  with  A  =  3,  MEV(3) 

•  mixed  double-exponential  with  A  =  1,  MDE(l) 

•  mixed  double-exponential  with  A  =  2,  MDE(2) 

•  mixed  double-exponential  with  A  =  3,  MDE(3) 

•  mixed  Cauchy  with  A  =  1,  MC(1) 

•  mixed  Cauchy  with  A  =  2,  MC(2) 

•  mixed  Cauchy  with  A  =  3,  MC(3). 

A  total  of  1000  repetitions  were  run  for  each  configuration.  A  counter  of  the 
number  of  times  the  value  of  the  test  statistic  exceeded  the  critical  value  was  kept. 
The  value  of  the  counter  was  output  after  every  50  repetitions  to  permit  a  check 
that  the  random  number  generator  was  not  repeating  the  same  cycle  of  values. 

The  only  direct  comparison  with  similar  simulation  studies  in  the  literature 
is  with  Table  4  of  Johnson  et  al.  (1987).  Neither  Good  (1979)  nor  Conover  and 
Salsburg  (1988)  report  simulation  results  in  their  papers,  although  the  latter  indi- 
cate "good  agreement"  of  empirical  and  nominal  significance  levels  of  their  derived 
tests  used  with  an  ad-hoc  t-distribution  approximation.  Boos  and  Brownie  (1986) 
present  some  simulation  results  but  use  values  of  w  =  0.6,0.8,  or  1.0  to  generate 
nonresponders  data  with  no  censoring. 

In  the  following,  the  results  of  our  simulation  study  will  be  presented  in  three 
parts.  In  the  first  part  we  will  compare  our  uncensored  data  results  of  Johnson  et 
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al.  (1987)  in  those  cases  where  such  comparison  is  possible.  In  the  second  part,  we 
will  prsent  an  overall  discussion  of  our  results  in  the  uncensored  data  case,  while 
the  third  part  will  be  reserved  for  a  discussion  of  censored  data  results. 

Comparison  with  Johnson  et  al.  (1987)  results 

Tables  5.1  and  5.2  display  a  comparison  of  results  from  common  configurations 
between  our  simulations  and  some  of  the  simulation  figures  reported  in  Table  4  of 
Johnson  et  al.  (1987).  Recall  that  the  statistics  GH,  PP  and  W  are  the  same  here 
because  there  is  no  censoring.  Features  in  common  for  the  both  simulations  include: 

1.  There  is  no  censoring. 

2.  The  underlying  distribution  g(x  :  n,  A)  is  mixed  normal. 

3.  A  large  sample  approximation  is  used. 

4.  Powers  of  the  tests  were  estimated  using  1000  trials 

5.  Sample  sizes  of  N0  =  Nt  =  20  were  used. 

6.  All  possible  combinations  of  parameter  values  A  =  1(1)3  and  x  =  0.1(0.1)0.3, 
as  well  as  the  null  tt  =  0  case  were  considered. 

The  differences  in  features  are: 

1.  Johnson  et  al.  (1987)  use  a  different  variance  estimator  of  v. 

2.  Sample  sizes  of  N0  =  Nt  =  50  were  employed  in  our  simulation,  whereas 
Johnson  et  al.  (1987)  used  N0  =  ^  =  40. 
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3.  The  significance  levels  were  estimated  from  6000  trials  in  Johnson  et  al.  (1987); 
our  estimated  levels  were  based  on  1000  trials. 

Table  5.8  gives  the  most  direct  comparison  between  the  simulation  of  Johnson 
et  al.  (1987)  and  ours  because  NQ  =  Nx  =  20.  In  our  study,  the  ability  of  a  test 
statistic  to  hold  the  nominal  level  of  a  =  0.05  was  ascertained  on  the  basis  of  the 
interval  0.05  ±  2  ■  SE,  where 

c„        /(0.05)(0.95) 

If  the  empirical  sizes  fell  outside  the  interval  of 

(.036,  .064), 

there  is  evidence  that  the  corresponding  test  statistic  does  not  hold  the  nominal 
value  of  a  =  0.05.  It  should  also  be  kept  in  mind,  however,  that  the  chosen  sample 
sizes  may  not  be  large  enough  to  insure  asymptotic  normality  of  a  given  test  statistic. 
With  this  criterion,  none  of  the  estimated  sizes  in  Table  5.8  significantly  differ  from 
0.05,  although  MN(2)  is  close. 

For  the  alternative  configurations,  the  most  striking  pattern  is  that  the  MN(2) 
test  in  our  simulation  consistently  has  higher  power  than  the  version  of  MN(2)  in 
Johnson  et  al.  (1987).  There  is  also  a  pattern  of  the  MN(1),  W,  and  NS  statistics 
having  empirical  powers  clearly  above  Johnson  et  al.  (1987)  when  tt  =  0.1  or  0.2  and 
the  underlying  distribution  has  A  =  3.  However,  both  the  NS  and  W  test  statistics 
appear  in  agreement  with  their  Johnson  et  al.  (1987)  counterparts  when  jr  =  0.3. 
The  difference  in  variance  estimators  when  N0  =  Nt  =  20  for  our  test  procedures 


140 


Table  5.8:      Empirical  powers  under  a  mixed  normal  distribution  and  no  censoring, 
with  group  sample  sizes  of  20. 


Test 


w 


NS 


MN(1) 


MN(2) 


N0  =  7Vt  =  20 

7T  =  0 

7T=0.1                                  7T  =  0.2 

A                                A 

7T  =  0-3 
A 


Size 


.039 
(.055) 


1 


.040 
(.055) 


.041 
(.057) 


.063 
(.049) 


.100      .107     .142 
(.099)  (.112)  (.113) 


.112     .118     .166 

(.099)  (.134)  (.137) 


.122     .176     .265 
(.103)  (.170)  (.215) 


.148     .228     .367 


1 


.131     .236     .312 
(.159)  (.227)  (.253) 


1 


.143     .271     .353 
(.163)  (.247)  (.292) 


182     .383     .600 
(.177)  (.343)  (.471) 


.215     .447     .701 


.233     .404     .502 
(.241)  (.431)  (.491) 


.245     .442     .556 
(.224)  (.458)  (.542) 


.278     .614     .827 
(.245)  (.606)  (.755) 


.287     .667     .885 


(.097)  (.163)  (.234)|(.156)  (.337)  (.519)  (.208)  (.574)  (.804) 


All  the  numbers  in  parentheses  are  taken  from  Table  4  of  Johnson  et  al.  (1987). 
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as  compared  with  Johnson  et  al.  (1987)  could  be  a  reason  for  the  disparities  when 
larger  values  of  A  for  either  the  underlying  distribution  or  the  mixed  normal  scores 
are  chosen.  It  is  interesting  that  for  the  W,  NS,  and  MN(1)  statistics,  our  empirical 
levels  were  lower  than  Johnson  et  al.  (1987),  while  under  the  alternative  configura- 
tions, our  versions  tended  to  have  higher  power  than  those  versions  of  Johnson  et 
al.  (1987). 

Table  5.9  contrasts  our  results  for  mixed  normal  distributions  with  no  censoring 
and  sample  sizes  NQ  =  Nt  =  50  with  the  corresponding  results  of  Johnson  et 
al.  (1987)  for  sample  sizes  of  N0  =  AT,  =  40.  The  sizes  for  W,  NS,  and  MN(1) 
are  in  better  agreement  than  in  Table  5.8,  perhaps  due  to  better  agreement  of  our 
variance  estimates  and  the  variance  estimates  of  Johnson  et  al.  (1987)  for  larger 
sample  sizes.  The  results  are  similar  to  Johnson  et  al.  (1987)  except  that  our  test 
statistic  versions  again  seem  to  have  slightly  elevated  power,  perhaps  partly  due  to 
the  difference  in  sample  sizes.  Our  MN(2)  statistic  appears  liberal  by  the  criterion 
of  significantly  exceeding  the  nominal  level  of  0.05. 

Simulation  Results  in  the  Uncensored  data  case 

The  complete  range  of  results  of  our  simulation  study  on  the  test  statistics 
when  there  is  no  censoring  of  the  data  is  now  discussed.  Recall  the  three  types 
of  underlying  distributions  selected— normal,  Cauchy,  and  extreme-value.  In  the 
following  we  present  some  typical  results  of  the  patterns  that  were  observed  from 
uncensored  data  in  the  simulation  study.  A  listing  of  these  patterns  for  uncensored 
data  is: 
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Table  5.9:      Empirical  powers  under  a  mixed  normal  distribution  and  no  censoring, 
with  group  sample  sizes  of  50  (40). 


N0  =  N1=  50     (N0  =  Nx  =  40) 

7T  =  0                 7T  =  0.1                              7T  =  0.2                              tt-0.3 

AAA 

Test 

Size 

1          2          3 

1          2          3 

1          2         3 

W 

.052 

.114     .166     .221 

.215     .452     .515 

.389     .724     .814 

(.058) 

(.110)  (.170)  (.171) 

(.202)  (.360)  (.460) 

(.329)  (.661)  (.775) 

NS 

.046 

.126     .197     .285 

.234     .517     .599 

.420     .770     .874 

(.056) 

(.114)  (.194)  (.206) 

(.201)  (.399)  (.524) 

(.340)  (.689)  (.828) 

MN(1) 

.056 

.160     .351     .548 

.269     .762     .925 

.464     .940     .994 

(.055) 

(.131)  (.282)  (.368) 

(.249)  (.609)  (.819) 

(.392)  (.870)  (.969) 

MN(2) 

.080 
(.049) 

.184     .413     .721 
(.108)  (.290)  (.461) 

.264     .789     .964 
(.216)  (.626)  (.887) 

.428     .927     .996 
(.344)  (.950)  (.988) 

All  the  numbers  in  parentheses  are  taken  from  Table  4  of  Johnson  et  al.  (1987). 
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1.  As  the  chosen  A  value  for  a  mixed-model  score  gets  larger,  the  power  of  the 
corresponding  test  also  gets  larger. 

2.  For  group  sample  sizes  of  50,  all  mixed-model  score  tests  hold  nominal  signifi- 
cance levels  fairly  well  except  for  the  mixed-normal  based  tests  with  statistics 
MN(2)  and  MN(3). 

3.  When  group  sample  sizes  are  20,  the  mixed-model  score  tests  based  on  A  =  1 
or  A  =  2  hold  the  nominal  a  =  0.05  except  for  the  normal  case  MN(2). 

4.  The  ML(2),  MEV(2),  and  MN(1)  consistently  had  high  powers  over  the  con- 
figurations studied. 

We  now  look  at  the  existence  of  these  patterns  for  some  example  configurations  in 
our  simulation  study. 

For  purposes  of  illustration,  Table  5.10  displays  the  complete  set  of  figures  ob- 
tained from  a  particular  combination  of 

•  Sample  size:  NQ  =  iVj  =  50 

•  Survival  distribution:  extreme-value 

•  Censoring  distribution:  None  (or  technically,  L(x)  =  1  for  all  x) 

•  Censoring  rate:  0%. 

Recall  from  Section  4.2  that  the  score  functions  in  the  MEV(  )  statistics  in  Table 
5.10  are  not  derived  from  the  extreme- value  form  of  the  underlying  distribution, 
i.e.  the  logarithm  of  an  exponential  random  variable.  Rather,  the  scores  are  derived 
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Table  5.10:    Empirical  powers  under  a  mixed  extreme- value  distribution  and  no  cen- 
soring, with  group  sample  sizes  of  50. 


N0  =  N1=  50 

7T  =  0  T  =0.1  7T  =  0.2 

A  A 


7T  =  0.3 

A 


Test 

Size 

1       2       3 

1       2       3 

1       2       3 

LG 

.042 

.165  .305  .417 

.306  .686  .842 

.539  .930  .974 

GH 

.036 

.107  .143  .197 

.194  .406  .508 

.350  .632  .754 

PP 

.036 

.107  .143  .197 

.194  .406  .508 

.350  .632  .754 

MN(1) 

.054 

.173  .386  .533 

.353  .790  .920 

.604  .966  .992 

MN(2) 

.069 

.207  .541  .710 

.414  .894  .982 

.642  .995  .998 

MN(31 

.084 

.223  .589  .764 

.433  .912  .985 

.687  .992  .999 

ML(1) 

.035 

.138  .245  .319 

.261  .580  .742 

.478  .872  .947 

ML(2) 

.050 

.160  .349  .486 

.329  .756  .906 

.570  .961  .991 

MUZ) 

.050 

.179  .465  .631 

.379  .853  .966 

.612  .987  .996 

MEV(l) 

.035 

.118  .184  .239 

.227  .477  .609 

.397  .753  .856 

MEV(2) 

.044 

.147  .311  .428 

.304  .706  .853 

.537  .940  .989 

MEV(3) 

.052 

.168  .442  .612 

.366  .843  .959 

.598  .988  .996 

MDE(l) 

.034 

.122  .194  .242 

.223  .482  .613 

.373  .765  .979 

MDE(2) 

.045 

.149  .329  .449 

.316  .746  .902 

.524  .960  .992 

MDE(3) 

.058 

.189  .527  .689 

.390  .882  .978 

.601  .991  .998 

MC(1) 

.049 

.081  .087  .079 

.121  .176  .224 

.182  .365  .504 

MC(2) 

.043 

.078  .092  .108 

.143  .259  .390 

.204  .555  .739 

MC(3) 

.050 

.093  .123  .147 

.158  .356  .568 

253  .700  .897 
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from  an  extreme- value  form  with  a  density  achieved  by  taking  the  logarithm  of  the 
inverse  of  an  exponential  random  variable.  Hence,  the  MEV(  )  statistics  will  not 
necessarily  tend  to  perform  the  best  out  of  the  mixed-model  tests  in  terms  of  power 
and  maintaining  a  =  0.05.  But  the  similarity  of  the  extreme-value  forms  likely 
allows  the  good  consistent  performance  of  the  MEV(  )  family  of  test  statistics. 

Because  there  are  no  censored  observations,  the  GH  and  PP  statistics  reduce  to 
W,  the  Wilcoxon  Rank  Sum,  and  thus  should  have  identical  empirical  power.  Also, 
the  figures  in  Table  5.10  generally  indicate  that  as  the  value  of  A  in  the  mixed-model 
scores  gets  larger,  the  power  of  the  test  statistic  also  tends  to  get  larger,  regardless  of 
what  the  true  value  of  A  is  in  the  underlying  survival  distribution.  The  asymptotic 
relative  efficiencies  displayed  in  Section  5.1  indicated  that  power  is  lost  according 
to  the  degree  that  the  value  of  A  chosen  in  the  scores  is  misspecified  relative  to  the 
true  value  of  A  in  the  underlying  distribution.  However,  the  efficiencies  are  local, 
i.e.  are  interpreted  in  terms  of  nN  -»  0  as  N  -+  oo.    The  lack  of  a  power  loss  for 
misspecified  A  featured  in  Table  5.10  and  in  other  simulations  conducted  in  our 
study  is  not  unique— Table  4  of  Johnson  et  al.  (1987)  exhibits  the  same  feature  to 
some  degree  as  well. 

Most  of  the  test  statistics  in  Table  5.10  have  significance  levels  that  lie  within 
two  standard  errors  of  0.05.  The  MN(2)  and  MN(3)  empirical  significance  levels 
fall  well  above  two  standard  errors  away  from  0.05,  thereby  suggesting  that  the  test 
associated  with  these  statistics  tend  to  be  liberal.  The  MN(2)  and  MN(3)  statistics 
do  display  the  highest  powers  for  detecting  the  various  alternative  cases. 
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Among  those  test  statistics  that  maintain  the  nominal  significance  levels,  MN(1), 
ML(3),  MEV(3),  and  MDE(3)  have  superior  power.  The  power  values  of  the  stan- 
dard LG  test  statistic  gain  ground  as  tt  gets  larger  but  the  GH=PP=W  statistic 
performs  poorly  throughout.  The  statistic  MDE(3)  has  the  highest  power  values  for 
the  most  alternative  settings,  while  ML(3)  had  the  second  highest.  The  ML(3)  sig- 
nificance level  does  have  an  edge  (0.05)  over  the  MDE(3)  significance  level  (0.058). 
Also  note  that  the  power  values  of  MEV(3)  were  continually  close  behind  those  of 
ML(3).  The  ML(2)  and  MEV(2)  statistics  do  fairly  well  in  comparison  to  ML(3) 
and  MEV(3). 

When  N0  =  Ni  =  20  for  the  extreme-value/no  censoring  configuration,  the 
mixed  model  test  statistics  with  scores  based  on  A  =  2  or  3  consistently  have  high 
significance  levels.  The  test  statistics  using  A  =  3  in  their  scores  all  had  significance 
levels  of  0.071  or  higher,  and  thus  were  not  included  in  Table  5.11.  The  MDE(2) 
statistic,  with  a  significance  level  of  0.058,  possesses  the  higher  power  figures  out 
of  those  listed.  Note  that  the  LG  performs  better  than  the  ML(1)  and  MEV(l) 
statistics  in  terms  of  power. 

The  tendency  of  liberal  tests  and  increased  power  also  showed  up  in  simulations 
when  the  underlying  survival  distribution  was  mixed  normal.  Tables  5.8  and  5.9 
provided  us  with  a  subset  of  the  results  that  was  previously  discussed.  In  comparison 
with  the  other  mixed-model  tests  and  the  LG  and  NS  statistics,  the  MN(1)  did  well 
in  terms  of  maintaining  the  significance  level  and  detecting  alternative  values  of  jr, 
when  the  group  samples  sizes  were  20  or  50. 
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Table  5.11:    Empirical  powers  under  a  mixed  extreme- value  distribution  and  no  cen- 
soring, with  group  sample  sizes  of  20. 


7T  =  0 

jV0  =  ATi  =  20 

7T  =  0.1                 7T  =  0.2                 tt  =  0.3 

AAA 

Test 

Size 

1       2       3 

1       2       3 

1       2       3 

LG 

.053 

.094  .161  .223 

.163  .332  .444 

.257  .576  .706 

MN(1) 

.062 

.111  .193  .261 

.178  .401  .523 

.297  .660  .800 

ML(1) 
ML(2) 

.049 
.059 

.081  .142  .192 
.109  .191  .254 

.150  .297  .401 
.172  .405  .527 

.234  .520  .656 
.286  .659  .808 

MEV(l) 
MEV(2) 

.052 
.059 

.079  .110  .156 
.099  .171  .233 

.142  .250  .320 
.162  .378  .488 

.201  .414  .544 
.275  .630  .783 

MDE(2) 

.058 

.124  .209  .282 

.176  .441  .592 

.295  .704  .857 
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The  powers  of  all  the  tests  included  in  our  simulations  were  low  for  a  heavy  tailed 
Cauchy  survival  distribution.  Figures  are  not  displayed  for  the  sake  of  brevity.  The 
MC(1)  and  MC(2)  consistently  had  the  best  power  values  relative  to  the  other 
test  statistics.  Among  these  others,  the  MDE(l)  test  statistic  performed  the  best. 
There  was  a  sharp  dropoff  to  the  power  figures  for  MDE(2),  however.  There  was  a 
general  tendency  for  powers  of  all  the  mixed-model  test  statistics  to  decline  for  larger 
choices  of  A  in  the  scores,  which  is  contrary  to  the  tendency  when  the  underlying 
distribution  was  normal  or  extreme-value. 

Simulation  Results  in  the  Censored  data  case 

As  in  the  uncensored  data  simulation  results,  we  present  some  typical  results  of 
the  patterns  that  were  observed  from  the  several  configurations  of  censored  data  in 
the  simulation  study.  The  patterns  observed  for  censored  data  include: 

1.  Among  the  standard  location-shift  model  tests,  the  log-rank  (LG)  performs 
well  relative  to  the  mixed-model  tests  for  smaller  values  of  A  (e.g.  A  =  1) 
in  the  underlying  distribution  or  when  the  censoring  rate  becomes  heavy.  It 
virtually  always  has  greater  power  than  the  Gehan  (GH)  and  Peto-Peto  (PP) 
statistics. 

2.  Mixed-model  score  tests  consistently  have  better  power  than  the  log-rank  test 
when  the  underlying  distribution  has  the  settings  A  =  2  or  A  =  3. 

3.  When  the  censoring  rate  is  small  to  moderate  (e.g.  0-25%)  and  A  >  1,  the 
ML(2),  MEV(2),  and  MN(1)  are  best  among  the  mixed-model  tests  studied 
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in  the  sense  of  similar  power  and  holding  nominal  significance  levels  over  the 
wide  variety  of  censoring  and  survival  distributions  considered. 

4.  For  heavy-tailed  distributions  and  small  sample  sizes  all  the  tests  generally 
exceed  the  nominal  level  a  =  0.05.  The  mixed-model  tests  based  on  Cauchy 
scores  are  generally  weak  in  power  and  are  challenged  by  the  GH  and  PP 
statistics  in  limited  instances  where  the  normal  approximation  of  the  distri- 
bution of  the  test  statistic  appears  adequate. 

The  following  examples  from  our  simulation  study  illustrate  these  last  remarks. 

The  reported  censoring  rate  determined  by  the  parameters  A  and  M  in  the 
censoring  distributions  strictly  applies  to  the  null  hypothesis  case  of  equal  survival 
distributions  for  the  control  and  treatment  groups.  Under  alternative  hypotheses, 
the  rate  will  be  higher  in  the  treatment  group,  since  the  censoring  distribution  is 
common  for  the  two  groups  in  all  the  simulations  conducted  for  this  dissertation. 

As  with  the  uncensored  data  results,  we  begin  with  the  extreme-value  form  for 
the  underlying  survival  distribution.  Results  with  iV0  =  AT,  =  50  for  the  mild 
censoring  rate  of  10%  and  both  exponential  and  uniform  censoring  distributions 
are  given  in  Table  5.12.  The  MDE(3),  MN(2),  and  MN(3)  statistics  (results  not 
displayed)  had  significance  levels  of  0.065  or  greater.  For  the  rest  of  the  statistics 
under  consideration,  the  ML(3)  and  MEV(3)  statistics  clearly  had  the  highest  pow- 
ers, with  ML(2)  and  MEV(2)  doing  well  in  comparison.  There  appear  to  be  no 
marked  differences  for  the  powers  due  to  whether  the  censoring  distribution  form 
was  exponential  or  uniform. 
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Table  5.12:  Empirical  powers  under  a  mixed  extreme- value  distribution  and  10% 
exponential  and  uniform  (in  parentheses)  censoring  rates,  with  group 
sample  sizes  of  50. 


N0  =  Ni=  50 

Exponential  and  Unifor 

7T  =  0                ff  =  0.1 

m  Censoring  Distributions 

7T  =  0.2                             T  =  0.3 

AAA 

Test 

Size 

1          2          3 

1           2          3 

1          2          3 

LG 

.053 

.160     .303     .386 

.293     .615     .772 

.449     .873     .958 

(.054) 

(.140)  (.292)  (.391) 

(.290)  (.647)  (.830) 

(.455)  (.900)  (.966) 

MN(1) 

.058 

.183     .351     .455 

.323     .678     .834 

.483     .909     .982 

(.056) 

(.162)  (.333)  (.464) 

(.304)  (.702)  (.876) 

(.492)  (.930)  (.985) 

ML(2) 

.054 

.170     .326     .438 

.307     .669     .829 

.475     .906     .978 

(.054) 

(.153)  (.310)  (.455) 

(.296)  (.690)  (.873) 

(.487)  (.921)  (.982) 

ML(3) 

.059 

.187     .373     .512 

.328     .733     .883 

.503     .931     .987 

(.058) 

(.174)  (.367)  (.513) 

(.311)  (.732)  (.901) 

(.499)  (.929)  (.988) 

MEV(2) 

.055 

.161     .306     .392 

.285     .641     .807 

.450     .886     .970 

(.051) 

(.145)  (.298)  (.412) 

(.281)  (.667)  (.848) 

(.474)  (.907)  (.974) 

MEV(3) 

.051 

.176     .355     .506 

.313     .727     .884 

.477     .926     .984 

(.056) 

(.170)  (.361)  (.510) 

(.292)  (.725)  (.903) 

(.479)  (.930)  (.986) 

151 
Table  5.13  indicates  a  less  clear  picture  when  the  censoring  rate  is  a  moderate 
25%.  The  ML(3)  and  MEV(3)  statistics  have  high  significance  levels  under  uniform 
censoring,  but  not  under  exponential  censoring,  where  interestingly,  ML(2)  and 
MEV(2)  have  higher  significance  levels.  The  MN(1)  and  MN(2)  statistics  behave 
similarly.  The  LG,  MN(1),  ML(2),  MEV(2),  and  MDE(2)  are  preferable  since  they 
maintain  a  =  0.05  and  have  comparable  power. 

The  effect  of  a  uniform  censoring  distribution  is  more  pronounced  for  a  censor- 
ing rate  of  40%.  All  of  the  test  statistics  in  the  study  had  estimated  significance 
levels  of  0.064  or  higher  under  uniform  censoring,  with  most  of  them  falling  three 
standard  errors  or  more  above  0.05.  On  the  other  hand,  all  of  the  test  statistics 
had  significance  values  of  0.063  or  less  under  exponential  censoring,  that  is,  values 
within  two  standard  errors  of  0.05.  Results  are  not  displayed  here,  but  again,  LG, 
MN(1),  ML(2),  MEV(2),  and  MDE(2)  were  most  appealing  in  terms  of  balancing 
good  power  under  alternative  parameter  settings  along  with  maintenance  of  the 
nominal  a  =  0.05. 

Results  for  an  underlying  extreme-value  distribution,  N0  =  Ni  =  20,  and  the 
presence  of  censoring  are  not  displayed,  but  in  summary,  for  exponential  censoring, 
the  MN(1),  ML(2),  MEV(2),  and  MDE(2)  statistics  all  performed  the  best  for  both 
the  10%  and  25%  censoring  rates.  The  LG  statistic  again  was  a  worthwhile  competi- 
tor, but  the  power  figures  were  not  as  nearly  close  to  the  mixed-model  counterparts 
as  in  the  uniform  censoring  case.  Under  uniform  censoring,  only  test  statistics  hav- 
ing mixed-model  scores  with  A  =  1  maintained  reasonable  significance  levels  for 
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Table  5.13:  Empirical  powers  under  a  mixed  extreme- value  distribution  and  25% 
exponential  and  uniform  (in  parentheses)  censoring  rates,  with  group 
sample  sizes  of  50. 


Np  =  Nr  =  50 
Exponential  and  Uniform  Censoring  Distributions 

2E  =  0  7T  =  0-1  7T  =  0.2  7T  =  0.3 


Test 


LG 


MN(1) 

MN(2) 


ML(2) 
ML(3) 


MEV(2) 
MEV(3) 


MDE(2) 
MDE(3) 


Size 


.051 
(.048) 


.058 
(.061) 

.053 
(.067) 


.055 
(.058) 

.051 

(.062) 


.057 
(.060) 

.052 
(.065) 


1 


152     .218     .225 
(.129)  (.230)  (.269) 


1 


147     .245     .248 
(.142)  (.238)  (.290) 

.160     .278     .272 
(.142)  (.253)  (.304) 


.156     .241     .249 
(.135)  (.233)  (.288) 

156     .269     .266 
(.145)  (.245)  (.301) 


.226     .498      .576 
(.275)  (.479)  (.587) 


.245     .513  .621 

(.276)  (.501)  (.618) 

.257     .537  .638 

(.272)  (.508)  (.626) 


1 


.377     .746     .845 
(.407)  (.740)  (.847) 


.052 
(.057) 

.057 
(.066) 


.153     .231     .238 
(.136)  (.221)  (.276) 

159     .265     .258 
(.139)  (.236)  (.302) 


158     .249     .246 
(.133)  (.221)  (.296) 

161     .284     .272 
(.147)  (.246)  (.308) 


234     .510      .613 

(.278)  (.504)  (.624) 

246     .527      .627 


399     .749     .869 
(.407)  (.760)  (.857) 

.388     .768     .882 
(.369)  (.731)  (.826) 


.398     .751     .862 
(.398)  (.765)  (.856) 

392     .758     .881 


(.267)  (.515)  (.632)  (.366)  (.736)  (.834) 


.220     .497      .603 
(.267)  (.501)  (.615) 

241     .521      .622 
(.256)  (.507)  (.610  ) 


229     .506  .612 

(.256)  (.505)  (.618) 

.247     .532  .628 

(.263)  (.496)  (.610) 


393     .743     .858 
(.385)  (.755)  (.851) 

387     .754     .873 
(.354)  (.724)  (.814) 


379     .741  .870 

(.367)  (.742)  (.834) 

.379     .762  .877 

(.350)  (.711)  (.816) 
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both  the  10%  and  25%  censoring  rates.  The  LG  statistic  had  comparably  high 
values  around  0.06.  Powers  under  alternative  parameter  settings  were  similar  for 
MN(1),  ML(1),  MEV(l),  and  MDE(l).  The  LG  statistic  compared  very  well  with 
these  mixed-model  tests.  A  simulation  with  40%  censoring  rate  was  not  attempted 
for  sample  sizes  of  20,  since  the  figures  from  the  lower  censoring  rates  indicated 
that  the  asymptotic  distributions  of  the  test  statistics  were  suspect  under  uniform 
censoring. 

Dependence  on  the  censoring  distribution  was  also  apparent  for  the  performance 
of  the  mixed-model  test  statistics  when  the  underlying  survival  distribution  was 
normal.  Again  for  a  10%  censoring  rate  and  N0  =  Ni  =  50,  the  MN(2)  and  MN(3) 
statistics  had  significance  levels  ranging  from  0.066  to  0.093.   The  results  for  the 
10%  censoring  rate  and  N0  =  JV,  =  50  are  not  given  here.   The  MN(1)  statistic 
performed  best  under  exponential  or  uniform  censoring,  with  ML(2)  close  behind. 
Table  5.14  contains  results  from  the  N0  =  Nx   =  20  and  10%  censoring  rate 
simulation  runs.    The  censored  data  extension  of  the  normal  scores  statistic,  NS, 
is  included  since  the  underlying  distribution  is  normal.     Note  however  that  the 
LG  statistic  always  has  greater  power  in  Table  5.14  than  the  NS  statistic  while 
maintaining  the  nominal  a   =  0.05.     The  MN(2)  statistic  again  appears  liberal 
under  exponential  censoring.  For  those  tests  maintaining  significance  levels  at  the 
nominal  a  =  0.05,  the  MN(1)  has  the  highest  power  values  most  frequently,  with 
ML(2)  second  in  frequency  of  high  power  values. 
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Table  5.14:  Empirical  powers  under  a  mixed  normal  distribution  with  10%  expo- 
nential and  uniform  (in  parentheses)  censoring  rates,  with  group  sample 
sizes  of  20. 


N0  =  Nl=  20 

Exponential  and  Uniform  Censoring  Distrib 

utions 

t  =  0.3 

7T  =  0                      7T  =  0.1                                       7T  =  0.2 

AAA 

Test 

Size 

1           2          3 

1          2          3 

1          2         3 

LG 

.056 

.094      .155     .220 

.177     .343     .472 

.265     .575     .716 

(.056) 

(.097)  (.194)  (.240) 

(.165)  (.375)  (.503) 

(.261)  (.558)  (.738) 

NS 

.048 

.085      .129     .170 

.142     .248     .345 

.236     .461     .588 

(.055) 

(.095)  (.165)  (.193) 

(.153)  (.284)  (.394) 

(.233)  (.467)  (.617) 

MN(1) 

.058 

.102      .170     .260 

.186     .378     .519 

.265     .606     .745 

(.057) 

(.108)  (.204)  (.260) 

(.176)  (.409)  (.547) 

(.271)  (.582)  (.777) 

MN(2) 

.079 

.122      .210     .317 

.207     .411     .569 

.280     .609     .772 

(.060) 

(.124)  (.231)  (.298) 

(.190)  (.443)  (.583) 

(.278)  (.593)  (.804) 

ML(2) 

.056 

.099      .174     .257 

.183     .377     .531 

.265     .608     .751 

(.060) 

(  .109)  (.208)  (.261) 

(.168)  (.409)  (.552) 

(.264)  (.587)  (.781) 
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For  the  25%  censoring  situation,  similar  patterns  emerged  for  the  N0  =  Ni  = 
50  sample  sizes.  MN(1)  and  ML(2)  were  among  the  statistics  with  high  powers 
and  acceptable  significance  levels.  The  MEV(2)  and  LG  were  the  other  statistics 
with  high  power.  As  with  extreme-value  distribution  results,  the  higher  censoring 
rate  ruled  out  more  of  the  mixed-model  tests  on  the  basis  of  failure  to  maintain 
significance  levels.  Even  more  were  ruled  out  when  the  N0  =  Ni  =  20  simulations 
were  run,  including  LG,  MN(1),  ML(2)  and  MEV(2).  Of  the  remaining  tests  with 
significance  levels  within  two  standard  errors  of  0.05  under  both  exponential  and 
uniform  censoring,  MEV(l)  consistently  had  higher  powers  than  the  GH,  PP,  and 
NS  statistics. 

Results  of  the  40%  censoring  rate  with  N0  =  Nx  =  50  are  contained  in  Table  5.15. 
An  interesting  aspect  is  the  presence  of  low  but  acceptable  significance  levels  under 
exponential  censoring.  The  statistics  MN(1),  ML(2),  and  MEV(2)  are  preferable 
under  exponential  censoring,  while  the  choice  of  A  =  1  or  2  in  ML(  )  and  MEV(  ) 
makes  little  difference  under  uniform  censoring.  The  LG  statistic  performs  well 
relative  to  the  mixed-model  tests,  particularly  under  uniform  censoring. 

Finally,  when  the  underlying  survival  was  Cauchy,  the  significance  levels  for  all 
the  tests  were  quite  high,  except  for  the  GH,  PP  and  MC(  )  statistics  in  limited 
instances,  where  power  tended  to  be  very  low  as  well.  Inadequate  sample  size  is  a 
possible  reason.  Results  are  not  displayed  here. 

In  summation,  the  mixed-model  tests  based  on  MN(1),  ML(2),  and  MEV(2)  are 
recommended  for  use  on  small  to  moderately  censored  data  when  the  nonresponders 
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Table  5.15:  Empirical  powers  under  a  mixed  normal  distribution  and  40%  exponen- 
tial and  uniform  (in  parentheses)  censoring  rates,  with  group  sample 
sizes  of  50. 


W0  =  Nj  =  20 
Exponential  and  Uniform  Censoring  Distributions 

S  =  0  7T  =  0.1  7T  =  0.2  7T  =  0.3 


Test 


LG 


MN(1) 

MN(2) 


ML(1) 
ML(2) 


MEV(l) 
MEV(2) 


Size 


.041 
(.056) 


.038 
(.058) 

.054 
f.062) 


.042 
(.060) 

.043 
(.062) 


.039 
(.063) 

.047 
(.062 


1 


.095     .168 
(.132)  f.195) 


.198 
(.227) 


1 


.097     .174 
(.128)  (.202) 

097     .193 
f.125)  (.200) 


.213 

(.224) 

.252 

(.208) 


.185 
(.235) 


.382     .480 
(.419)  (.456) 


.097     .170 
(.129)  (.198) 

.098     .181 


.198 
(.228) 

.213 

(.128)  (.207)  (.214) 


100  .160 

(.121)  (.197) 

095  .182 

127)  (.209) 


.184 
(.23) 

.215 

(.211) 


.189 
(.235) 

.185 
(.238) 


.395     .502 
(.410)  (.453) 

.377     .508 
(.378)  (.407) 


.290 
(.337) 


.191 
(.234) 

.189 
(.233) 


.377     .466 
(.423)  (.460) 

.397     .510 
(.406)  (.444) 


.293 
(.338) 

.299 


.621     .749 
(.608)  (.707) 

.633     .759 
(.617)  (.698) 

.613     .752 
(.577)  (.669) 


.190 
(.228) 

195 

(.231) 


.366     .429 
(.404)  (.442) 

.389     .493 
(.402)  (.443) 


.294     .618     .745 
(.338)  (.610)  (.706) 

.289     .626     .761 
(.337)  (.613)  (.697) 


294 
(.326) 

284 
(.328) 


.582     .711 
(.607)  (.692) 

.620     .753 
(.608)  (.689) 
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model  at  (2.2.4)  is  reasonable.  They  will  be  most  effective  in  detecting  a  treatment 
effect  for  values  of  A  >  1  in  the  underlying  survival  distribution  of  the  responders. 
Otherwise,  the  log-rank  statistic  is  preferrable  when  there  is  heavy  censoring  and 
A  <  1  in  the  survival  distribution  of  the  responders. 

5.3     Real  Data  Examples 

The  recommended  tests  of  the  previous  section  are  applied  to  two  examples  of 
real  data  thought  to  follow  the  nonresponders  scenario.  The  first  example  consists 
of  uncensored  data,  while  the  second  example  has  a  small  amount  of  censored  ob- 
servations. The  test  statistics  that  are  applied  include  the  log-rank  (LG),  MN(1), 
MEV(2),  and  ML(2).  Also  included  is  the  Peto-Peto  generalized  Wilcoxon  (PP) 
as  an  illustration  of  its  poor  performance  for  nonresponders  data  governed  by  the 
mixture  model  (2.2.4). 

Uncensored  Data  Example 

The  data  in  Table  5.16  are  taken  from  Conover  and  Salsburg  (1988,  Table  1). 
The  change  in  pain  on  some  analog  scale  is  measured  before  and  after  four  weeks  in 
this  acute  painful  diabetic  neuropathy  study.  The  values  in  Table  5.16  are  actually 

/baseline  measure \ 
\    final  measure    J  ' 

where  final  measure  refers  to  the  recorded  pain  value  after  four  weeks.  The  control 
subjects  received  a  placebo. 
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Table  5.16:   Change  in  pain  measures  from  diabetic  neuropathy  study 


Control  Subjects  (JV0  =  30) 


-1.490  -0.201  -0.128  -0.087  -0.054  0.000  0.000  0.000  0.000  0.000 
0.028  0.039  0.049  0.061  0.080  0.105  0.134  0.193  0.216  0.223 
0.273      0.288      0.330      0.357      0.487      0.541      0.793     1.042     1.099     1.609 


Treatment  Subjects  (N%  =  28) 


-1.535  -0.547  -0.201  -0.201  -0.154  -0.095  -0.049  0.000  0.000  0.000 
0.105  0.111  0.201  0.251  0.310  0.406  0.511  0.531  0.575  0.575 
0.773      0.981      1.299      1.299      1.322      1.386      1.792     2.398 


Source:  Conover  and  Salsburg  (1988,  Table  1) 

Values  represent  the  logarithm  of  the  ratio  of  baseline  to  final  measurements. 
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Table  5.17:   Comparison  of  test  statistic  values,  uncensored  data  example 


Test  Statistic 

cs 

Ours 

MN(1) 

1.728 

1.778 

MEV(l) 

1.911 

1.892 

W 

0.981 

0.993 

CS:  Conover-Salsburg 


Conover  and  Salsburg  (1988)  compared  results  from  the  calculations  of  the 
Wilcoxon  rank  sum  test,  Student's  t-test,  Good's  (1979)  test,  the  quantile  test 
of  Johnson  et  al.  (1987),  MN(1),  and  four  other  tests  based  on  scores  (sx(  )-s4(  )) 
in  their  notation)  derived  in  their  paper.  The  scores  s2  are  identical  to  our  MEV(  ) 
scores  with  A  =  log(5)  m  1.609.  The  test  procedure  of  Conover  and  Salsburg  (1988) 
use  a  variance  estimator  based  on  the  sum  of  squared  deviations  from  the  average 
of  the  N  scores  (e.g.  Randies  and  Wolfe,  1979)  to  standardize  v,  and  employ  a 
t-distribution  with  N  -2  degrees  of  freedom  to  compute  the  p-value.  For  samples 
sizes  like  those  in  this  example,  a  t-distribution  approximation  will  be  very  close 
to  the  standard  normal  distribution.   Table  5.17  lists  the  test  statistics  values  we 
computed  with  the  variance  estimator  V2  (3.4.16)  as  compared  to  the  Conover  and 
Salsburg  (1988)  versions.    Note  that  the  observed  values  of  the  test  statistics  are 
similar  in  each  row. 
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Table  5.18:   Test  statistic  values  and  computed  p- values,  uncensored  data  example 


Test  Statistic 

MN(  ) 

ML() 

MEV(  ) 

LG 

W 

1.0  1.609 


1.778  [0.038]  —  _ 

1.672  [0.047]  1.834  [0.033]  1.857  [0.032] 

1.426  [0.077]  1.892  [0.029]  1.931  [0.027] 

1.695  [0.045]  1.695  [0.045]  1.695  [0.045] 

0.993  [0.160]  0.993  [0.160]  0.993  [0.160] 


The  p-values  are  in  [  ]. 


From  our  simulation  results  in  Section  5.1,  appropriate  test  statistics  to  compute 
for  this  type  of  data  would  be  MN(1),  ML(2),  and  MEV(2).  Table  5.18  contains 
results  from  these  last  three  test  statistics  as  well  as  the  Wilcoxon,  log-rank  (LG), 
and  results  from  the  mixed-model  statistics  with  scores  using  A  =  1  or  A  =  log(5)  » 
1.609.  The  corresponding  p-values  are  given  in  brackets.  The  mixed-model  tests 
with  ML(  )  or  MEV(  )  in  Table  5.18  show  a  trend  of  lower  p-values  as  A  grows 
larger.  The  values  for  the  log-rank  test  are  constant,  of  course,  since  the  test  statistic 
does  not  depend  on  A.  Recall  that  for  uncensored  data,  the  log-rank  reduces  to 
the  exponential  scores  test  of  Savage  (1956).  The  p- value  of  0.045  for  LG  is  very 
comparable  to  those  p-values  of  other  statistics,  including  the  tests  of  Conover  and 
Salsburg  (1988),  who  did  not  include  it  (LG)  on  grounds  of  "losing  power"  when  A 
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tends  away  from  0.  Conover  and  Salsburg  (1988)  instead  recommend  an  "inverse 
Savage"  score  denoted  by  s3(  ).  The  test  statistic  based  on  the  these  inverse  Savage 
scores  s3(  )  yielded  an  observed  test  statistic  value  of  1.677  with  p- value  0.050  from 
the  data. 

Figure  5.3  is  plot  of  test  statistic  behavior  over  the  range  of  values  A  6  [1,2J. 
The  ML(  ),  MEV(  ),  LG,  W,  and  MN(1)  statistics  are  included.  Observe  that  for 
both  the  ML(  )  and  MEV(  )  statistics,  the  lowest  p-value  (or  equivalently  by  1:1 
properties,  the  highest  test  statistic  value)  occurs  at  A  =  2. 

Censored  Data  Example 

The  Veteran's  Administration  (VA)  lung  cancer  clinical  trial  (Prentice,  1973; 

Kalbfleisch  and  Prentice,  1980)  consisted  of  137  males  with  inoperable  lung  cancer 

that  were  randomized  to  either  the  standard  or  a  test  chemotherapy.    Analysis 

(Prentice,  1973;  Kalbfleisch  and  Prentice,  1980)  of  this  data  has  taken  into  account 

several  covariates  believed  to  have  prognostic  effects  on  the  response  of  interest, 

time  to  death.  Nine  observations  of  the  response  were  censored.  The  Kalbfleisch  k 

Prentice  (1980)  analyses  fit  several  regression  models  with  the  covariates  to  study 

treatment  differences.  One  of  the  covariates  indicated  whether  a  patient  in  the  trial 

had  received  any  prior  therapy.  This  covariate  was  not  found  to  have  a  significant 

effect.  There  was  also  no  evidence  of  significantly  improved  survival  time  for  patients 

receiving  the  test  therapy  instead  of  the  standard  therapy. 

A  complete  listing  of  the  data  can  be  found  in  Appendix  I  of  Kalbfleisch  and 
Prentice  (1980).  A  subset  of  the  data  from  patients  that  received  the  test  therapy  is 
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Figure  5.3:   Graph  of  test  statistic  behavior,  uncensored  data  example 
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displayed  in  Table  5.19.  This  subset  of  data  is  divided  into  two  groups-control  for 
those  patients  not  receiving  any  prior  therapy,  and  "treatment"  for  those  patients 
receiving  prior  therapy.  The  existence  of  other  covariates  such  as  cancer  cell  type, 
which  has  been  found  to  be  a  significant  factor  on  survival  time  in  regression  models, 
may  be  responsible  for  only  a  small  proportion  of  subjects  in  the  treatment  group 
responding  with  better  survival  times.  There  are  only  three  censoring  times  among 


Table  5.19:   Survival  times  in  days  from  Veteran's  Administration  lung  cancer  trial, 
patients  receiving  test  therapy 


Control  (No  Prior  Therapy)  Patients  (N0  =  49) 


1  7  7  8  8  13  15  19  21      24 

24  25  25  25  29  30  31  33  36      45 

48  49  51  52  52  53  61  73  80      80 

83*  87  87*  99  99  111  111  112  133 

140  186  242  283  357  378  389  467  587 


Treatment  (Prior  Therapy)  Patients  (Nt  =  19) 


1         2        15       18        19       20       43       44      51      84 
90      103     164     201     231*     231     340     991     999 


Source:  Kalbfleisch  and  Prentice  (1980,  Appendix  I) 
J:  Censored  times 


the  N  -  68  observations,  yielding  an  approximately  5%  censoring  rate.   The  test 
statistic  results  are  displayed  in  Table  5.20. 
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Table  5.20:   P-values  for  VA  lung  cancer  censored  data  example 


One-Sided     Two-Sided 
Test  Statistic       P-value  P-value 


None  of  the  tests  in  Table  5.20  strongly  indicate  a  treatment  effect,  although 
MN(1)  provides  marginal  evidence  near  the  0.05  significance  level  for  a  one-sided 
test  and  near  the  0.10  level  for  a  two-sided  test.  Notice  that  all  three  of  the  mixed- 
model  test  statistics:  MN(1),  ML(2),  and  MEV(2),  outperform  the  log-rank  test 
statistic. 


CHAPTER  6 
SUMMARY  AND  CONCLUSIONS 


The  "nonresponders"  problem  arises  in  the  study  of  a  treatment  where  the  sub- 
jects receiving  the  treatment  may  or  may  not  be  affected.    If  a  proportion  of  the 
subjects  receiving  treatment  are  not  affected,  then  outcome  data  from  these  non- 
responders follow  the  same  distribution  as  control  subjects  that  did  not  receive  the 
treatment.  A  measured  outcome  for  subjects  in  the  treatment  group  is  governed  by 
a  mixture  model  described  in  Section  2.2  and  presented  at  (2.2.4)  in  this  manuscript. 
The  mixture  model  at  (2.2.4)  has  been  employed  previously  in  the  statistical 
literature  for  uncensored  data.  In  this  dissertation,  the  potential  of  a  random  cen- 
soring mechanism  on  the  outcome  data  is  considered.  Since  the  aspect  of  censoring 
arises  in  the  context  of  survival  data,  nonparametric  tests  are  frequently  employed 
to  compare  groups  on  the  basis  of  stochastic  ordering  for  the  outcome  of  interest. 
Because  it  is  assumed  that  a  treatment  effect  results  in  larger  outcome  values  for 
the  treated  patients  relative  to  the  control  patients,  a  linear  rank  test  statistic  v 
that  is  specifically  designed  to  detect  a  treatment  effect  when  nonresponders  are 
present  can  be  generated  from  the  mixture  model  at  (2.2.4).   The  test  based  on  v 
can  also  handle  the  situation  of  stochastically  smaller  values  in  the  treatment  group; 
see  Section  3.6  for  details. 
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Because  v  is  designed  to  specifically  test  the  hypothesis  of  no  treatment  effect 
versus  the  alternative  hypothesis  that  there  are  a  nonzero  proportion  of  responders 
7T  in  the  treatment  group,  the  scores  that  make  up  v  depend  on  the  parameter  A 
that  represents  a  measure  of  the  change  in  outcome  for  responders.  Scores  also 
depend  on  a  form  for  the  underlying  distribution.  The  asymptotic  properties  for 
a  standardized  v  established  in  Chapter  Three  provide  means  to  conduct  tests  of 
significance  and  study  efficiencies  of  the  tests  under  alternative  hypotheses. 

Pitman  asymptotic  relative  efficiencies  and  simulation  studies  showed  that  to 
detect  a  treatment  effect  in  the  presence  of  nonresponders,  the  greater  ability  of 
mixed-model  tests  (based  on  v)  over  the  location-model  tests  such  as  the  log-rank 
or  a  generalized  Wilcoxon  is  most  apparent  for  larger  values  of  A  and  smaller  values 
of  7T  in  the  underlying  distribution  for  the  treatment  group.  Tests  based  on  scores 
derived  from  the  logistic  and  extreme-value  distributions  with  a  chosen  value  of 
A  =  2,  called  ML(2)  and  MEV(2),  respectively,  along  with  a  test  based  on  scores 
from  the  normal  distribution  with  a  chosen  value  of  A  =  1,  i.  e.  MN(1),  performed 
consistently  well  over  a  wide  range  of  data  configurations.    These  configurations 
(described  in  Section  5.2)  were  constructed  from  varying  sample  sizes,  censoring 
distributions  and  censoring  rates,  and  survival  distributions.  As  the  censoring  rate 
becomes  heavy  (40%),  the  log-rank  test  performs  as  well  as  the  recommended  mixed- 
model  tests  ML(2),  MEV(2),  and  MN(1).  Applications  to  studies  with  uncensored 
data  or  censored  data  affirmed  that  the  ML(2),  MEV(2),  and  MN(1)  tests  provide 
the  best  results,  with  the  log-rank  a  worthy  competitor. 
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All  the  practical  applications  in  this  manuscript  relied  on  asymptotic  theory  for 
the  distribution  of  v.  Further  research  on  exact  procedures  may  prove  interesting 
and  useful.  The  calculation  of  expected  scores  and  a  permutation  distribution  for 
a  given  set  of  data  may  provide  more  appropriate  results,  particularly  for  smaller 
{No  =  jV,  <  20)  sample  sizes.  Implementation  of  the  form  for  evaluating  expected 
scores  that  is  given  in  Section  3.5  appears  to  involve  considerable  computer  pro- 
gramming at  this  point. 
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